Skip to content
@GenoRobotics-EPFL

GenoRobotics

Tools for on-field DNA analysis and plant species identification

🌱 GenoRobotics

myImage

🕵️Introduction

Biodiversity preservation has become a major challenge of the XXIst century. The classical conservation and taxonomy approach no longer fit with the emergency of the situation. To fill the gap between new genomic conservation approaches and DNA sequencing technologies, GenoRobotics is developing a miniaturized tool to automatically process biological samples into the field and accelerate biodiversity identification.

Initiated in 2018, GenoRobotics started as a joint project between EPFL alumni and botanists to develop field tools for an expedition in the primary rainforests East of Madagascar. This highlighted the need for a solution to perform on-site DNA analysis in harsh environment. Therefore, GenoRobotics' goal is to popularize the access to DNA analysis for field applications by developing a fully portable, rugged and automated DNA analysis solution including DNA extraction and sequencing.

To identify the samples it studies, Genorobotics relies on the principle of DNA barcoding: four specific genes are amplified in plant samples and sequenced using the portable Oxford Nanopore MinIon sequencer. By comparison to large databases of genetic sequences such as GenBank, the sample's species can be reliably inferred.

💻Bioinformatics

The Bioinformatics division of Genorobotics handles the genomic data generated from sequencing, turning raw DNA reads generated by the MinIon into a species prediction. This requires three main steps:

  • Consensus Generation: After a PCR (Polymerase Chain Reaction) is performed, the sample's gene of interest is amplified into thousands of near-identical fragments, all read by the sequencer. However, this process induces mutations, DNA breaks and inaccuracies, making the reconstitution of the original sequence from the reads challenging.
  • Species Identification: Once the consensus sequence is generated, it must be compared to large databases of genetic sequences to determine the species. The DNA alignment must thus be performed efficiently, and the results of the four alignments must be put together, resulting in one final species prediction
  • Database: raw reads, species predictions, sample images, collaborating scientists...all of this data must be stored somewhere! This team looks to create a database that can be used to share information between members of Genorobotics, and potentially share knowledge with other international DNA databases. Other steps of the pipeline are also optimized with bioinformatics, such as the design of universal primers that can bind to DNA independent of the species.

💪Main repos

🛠️ utils: general functions for dealing with raw fastq reads, calling bash commands, requesting genetic sequences online from GenBank, etc.

🧬 Consensus: as described above

🌿 Identification: as described above

🌐 Database: as described above

🤖 run-pipeline: combine all functionalities into notebooks that can be run on raw sequencing data

👨‍🏫 Training: Resources for students to get started in the project with resources on the underlying biology, basic bioinformatics, collaborative coding with Github, etc.

Pinned Loading

  1. run-pipeline run-pipeline Public

    Jupyter Notebook

  2. Consensus Consensus Public

    Python 1

  3. Identification Identification Public

    Python 2

Repositories

Showing 10 of 11 repositories
  • GenoRobotics-EPFL/Vladimir-project’s past year of commit activity
    Python 0 0 0 0 Updated Jun 19, 2024
  • GenoRobotics-EPFL/Identification’s past year of commit activity
    Python 2 0 3 0 Updated May 22, 2024
  • utils Public
    GenoRobotics-EPFL/utils’s past year of commit activity
    Python 0 0 0 1 Updated May 12, 2024
  • Database Public
    GenoRobotics-EPFL/Database’s past year of commit activity
    TypeScript 0 1 3 0 Updated May 8, 2024
  • Lemanic-Hackathon Public

    Genorobotic's participation to the Lemanic Life Sciences Hackathon of April 2024

    GenoRobotics-EPFL/Lemanic-Hackathon’s past year of commit activity
    Python 0 0 0 0 Updated Apr 29, 2024
  • GenoRobotics-EPFL/Consensus’s past year of commit activity
    Python 1 0 3 0 Updated Apr 23, 2024
  • GenoRobotics-EPFL/run-pipeline’s past year of commit activity
    Jupyter Notebook 0 0 0 0 Updated Apr 14, 2024
  • .github Public
    GenoRobotics-EPFL/.github’s past year of commit activity
    0 0 0 0 Updated Apr 11, 2024
  • Primer-Design Public

    Designing Universal Primers for PCR amplification of the barcoding genes in all plant species

    GenoRobotics-EPFL/Primer-Design’s past year of commit activity
    Jupyter Notebook 0 0 0 0 Updated Mar 7, 2024
  • Training Public

    Training material for both computer science and life sciences students joining the project.

    GenoRobotics-EPFL/Training’s past year of commit activity
    Jupyter Notebook 0 0 0 0 Updated Mar 6, 2024

Top languages

Loading…

Most used topics

Loading…