Skip to content

Detecting Biomarker for deaseases using the Next Generation Sequencing Data Set

Notifications You must be signed in to change notification settings

Kavinaya12/e17-co328-NGS-Data-AnalysingToolkit

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation


Next Generation Sequencing Data Analysing Toolkit


Table of Contents

  1. Introduction
  2. High Level Architecture
  3. Software Model
  4. Mechine Learning Model
  5. Use case Diagram
  6. Team
  7. Links

Introduction

Alzheimer’s disease (AD) has now been identified as the sixth most leading death cause in world. According to the Alzheimer’s Association, no cure has still been found for AD. there is even no way to weaken the progress of spreading the AD in patient’s body. It also mentions that the only available treatment in the medical field is for reducing its symptoms, like memory loss and confusion. Over the last few decades, doctors have used few methods like medical tests for checking memory and scans like positron emission tomography (PET) for scanning the patient’s brain. But, it is not possible to diagnose Alzheimer’s disease for certain from those methods. microRNAs (miRNAs) are small non coding RNAs which mainly help for regulating gene expressions. The potential of miRNAs as biomarkers for disease diagnosis has been emphasized by several researchers, and miRNA biomarkers can be recognized as a set of biomarkers which could benefit not only diagnosis procedure but also the treatments.

The development of next-generation sequencing (NGS) technology has resulted in a rapid growth in the synthesis of large genomic datasets of miRNAs,and we are developing user-friendly tools for finding biomarkers of genes and visualizing this data have not kept pace.

Our project is to develop a web application which can provide a user-friendly interface for Alzheimer disease prediction and finding biomarker genes for particular disease using differential expression analysis of human mRNA sequence data. We improve on available tools by offering a range of normalization, feature selection and classification methods and a simple to use interface.

High Level Architecture

Software design

Our application can be used to analyze the provided RNA-Seq datasets or users can upload their own human RNA-seq data for finding biomarkers for particular disease and disease prediction. Dataset should have the form that genes as rows and samples as columns which should include both control and AD samples. Files can be uploaded as a .csv or .xls file.

In our application, User can:

  1. browse a table of provided data or upload their own, then apply various normalization, feature selection algorithms, and classifiers to it.
  2. use the single field search bar, which can be used to find genes in the table.
  3. By selecting a gene from the table, the user can view a box plot that shows whether or not the sample has an impact on a specific disease.

Features of our application:

  • Tool will be keep stable and hosted online, independent for web browser, and not work with local installation.
  • High quality analysis tools should be packaged in a way that does not require expert knowledge of programming, but be accessed via a graphical user interface (GUI).
  • Users can choose from different data normalization, feature selection, classification methods, giving them more options for data analysis.
  • The output and results should be available in an easy-to-use format for data tables and plots.
  • The ability to select a good data processing pipeline and NO need for programming skills to put it in place.

UI Designs

LogIn/SignUp

Upload File

Data Visualise

Boxplot Visualise

ML Model

Use case Diagram

Developers

  1. E/17/015 Arshad MRM. [Email]
  2. E/17/230 Nishankar S. [Email]
  3. E/17/159 Kavinaya Y. [Email]

Scrum Master

  1. Mr. Imesh Ekanayake [Email]

Product Owner

  1. Dr. Damayanthi Herath [Email]

Links

About

Detecting Biomarker for deaseases using the Next Generation Sequencing Data Set

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • CSS 51.6%
  • JavaScript 26.0%
  • SCSS 19.0%
  • Jupyter Notebook 3.2%
  • Other 0.2%