Skip to content
Tagged COVID-19 Biotechnology SARS-CoV-2 Life Science cancer CORONAVIRUS pandemic
BioXone

BioXone

rethinking future

March 7, 2026
  • About
  • BiotechTodayNews
    • IndiaWeekly Biotech News of India
    • WorldWeekly Biotech News of The World
  • DNA-TalesArticles
    • BiotechnopediaInteresting articles written by BioXone members and associates.
    • Scientists’ CornerArticles from the pioneers of Biotechnology.
    • Cellular CommunicationInterview of greatest researchers’ in the field.
  • Myth-LysisFact Check
  • Signalling PathwayCareer related updates
    • ExaminationsExamination related articles.
    • Job and InternshipJobs and Internship related articles.
  • Courses
  • Contact

Most Viewed This Week

October 17, 2023October 16, 2023

The Corrosion Prediction from the Corrosion Product Performance

1
October 1, 2023September 30, 2023

Nitrogen Resilience in Waterlogged Soybean plants

2
September 28, 2023September 28, 2023

Cell Senescence in Type II Diabetes: Therapeutic Potential

3
September 26, 2023September 25, 2023

Transgene-Free Canker-Resistant Citrus sinensis with Cas12/RNP

4
September 25, 2023September 25, 2023

AI Literacy in Early Childhood Education: Challenges and Opportunities

5
September 22, 2023October 1, 2023

Sustainable Methanol Vapor Sensor Made with Molecularly Imprinted Polymer

6

Search Field

Subscribe Now

  • Home
  • BiotechToday
  • Population-scale long-read sequencing and its approaches

ECMO: An artificial heart-lung set for COVID-19 treatment

Novel brain cells named “Gorditas" and "OPC" discovered

Population-scale long-read sequencing and its approaches
  • BiotechToday
  • World

Population-scale long-read sequencing and its approaches

BioTech Today June 22, 2021June 22, 2021

Sumedha B S, Bangalore University

Long-read sequencing (LRS), also called third-generation sequencing, offers a number of advantages over short-read sequencing such as Illumina’s NovaSeq, NextSeq, HiSeq and MiSeq instruments. Long-read sequencing technologies could permit the assembly of genomes, which is capable of revolutionizing genomics. It has the potential to reveal the full spectrum of human genetic variation, which would help in resolving some of the missing heritability also, leading to the discovery of new pathogenesis and mechanisms of diseases.

Currently, long-read sequencing technologies have reached a level of precision enabling application to variant detection in tens to thousands of samples. Advances in sequencing and bioinformatics have made it possible to achieve population-scale long-read sequencing. Pacific Biosciences (PacBio) and Oxford Nanopore Technologies (ONT) are the two major competitors working towards innovation in this field. Statistical approaches in genetics help to identify variants that correlate with a certain phenotype like a disease. Population-scale sequencing is defined here as the sequencing of more than 5 genomes, but in the case of limited genomic diversity, a lower number of genomes are sufficient.

Previous population studies, including genome-wide association studies (GWAS), have not been able to exhaustively characterize the genetic factors underlying human traits and diseases. However, a significant proportion of hidden variants could be discovered with long-read sequencing. Recent LRS projects involving Icelandic and Chinese populations have identified hidden variants related to anaemia, height, cholesterol levels. LRS is beneficial for improving the range, continuity, accuracy of variant phasing, and assessment of small variants. This has been used to find disease-associated alleles.

The largest human-focused population-scale long-read sequencing study examined the genomic diversity of 3,622 Icelandic genomes. As part of the Human Pangenome project, LRS of a global cohort diversity is being carried out. Aside from human studies, long-read sequencing has been applied on a population scale to discover structural variation associated with phenotypes in fruit flies, songbirds and crops.

Structural variants (SVs) are genomic alterations that are 50bp or larger, it includes deletions, duplications, insertions, inversions and translocations.

Project strategy needed to be considered: The total number of sequenced individuals or chromosomes should be as high as possible.

Different approaches are:

 A full coverage approach– It is the most expensive of the three approaches. Highest level of resolution is obtained with this strategy. All the samples receive similar coverage and are equally well studied. It aims to sequence each sample of population with medium to high coverage. The advantage of this approach is the simplicity of study design, comprehensiveness, easy detection of rare variations and relatively straightforward workflow.

A mixed coverage approach– Here, a subset of samples, representative of the subgroups in the subpopulations are sequenced at high and the rest at low coverage. This approach is less expensive than the full coverage approach, and it achieves high detection sensitivity. It is suitable for studies with a limited budget or a high number of individuals. But there will be a bias towards common alleles with this approach, as many rare alleles may be missed.

A mixed sequencing approach– This involves LRS of just a few samples- 10-20% and short-read sequencing of the remaining.  The basis of this approach is similar to selecting individuals for high coverage in mixed coverage strategy.

Other approaches developed:

Sequencing logistics. It involves efficiently operating long-read sequencers, from logistics to sample preparation, loading optimizations and run monitoring. ONT and PacBio have different advantages. It also has its own challenges in almost every step due to the different designs of flow cells and sequencing instruments. An adequate amount of high molecular weight DNA and highly pure input DNA are required.

Analytical considerations the main challenge in population-level studies is a scalable and streamlined analysis. Two main strategies for downstream analysis: aligning reads from individual samples to a single reference genome and comparing de novo assemblies. These methods are significantly different in the computational and coverage requirements. That depends on the complexity and size of the genome.

Read alignment-based analysis. This is often the most common method of choice for population-scale studies. This enables the comparison of all samples with a reference genome and is the reason why more than half of population studies use this. These methods are less computationally demanding.

Population-scale de novo assemblies. These approaches are very sensitive and used to reconstruct diverse regions of the genomes. This can also lead to a collapse of highly similar segmental duplications. Algorithms that leverage single-nucleotide variants (SNVs) that differentiate multiple copies of repeats are used. The main challenge faced is the correct representation of the ploidy.

Graph genome methods. These allow the study of variants that are undetected by the current hi-tech short-read SV discovery methods. Tools, such as GraphTyper2100, Paragraph101 and tools from the vg package45,96, have been developed to graph genome structures.

Variant validation and genotyping. In this,any variants showing polymorphic genotypes are excluded. This approach neglects that some types of SV have higher mutation rates which are responsible for possible repeated mutations. It delivers the first step towards a more reliable SV genotyping. This method has recently been used for the corvids crows and jackdaws successfully.

The development of different approaches will have a profound impact on improved variant representation and complexity of the underlying biology. However, this would require a shift from a linear to a more complex form of the reference genome. PacBio and ONT are currently leading in the development of LRS for multiple applications. Other companies (such as, Base4, Quantapore, Omniome) are developing novel long-read approaches, whose accountability needs to be assessed in the coming years. This field is very rapidly developing the area of genomics and established tools quickly become outdated and are replaced by new ones.

Also read: The curious case of Covid-19 Re-infection

References:

  1. De Coster, W., Weissensteiner, M.H. & Sedlazeck, F.J. Towards population-scale long-read sequencing. Nat Rev Genet (2021). https://doi.org/10.1038/s41576-021-00367-3
  • The Corrosion Prediction from the Corrosion Product Performance
  • Nitrogen Resilience in Waterlogged Soybean plants
  • Cell Senescence in Type II Diabetes: Therapeutic Potential
  • Transgene-Free Canker-Resistant Citrus sinensis with Cas12/RNP
  • AI Literacy in Early Childhood Education: Challenges and Opportunities

Share this:

  • Share on Facebook (Opens in new window) Facebook
  • Share on X (Opens in new window) X

Related

Tagged Bioinformatics Biostatistics DNA sequencing Genetics genome-wide association studies genomics long-read sequencing population studies sequencing structural variants whole genome sequencing

One thought on “Population-scale long-read sequencing and its approaches”

  1. Pingback: The human-associated clade of S. suis. - BioXone

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Next Post
  • BiotechToday
  • World

Novel brain cells named “Gorditas" and "OPC" discovered

BioTech Today June 22, 2021

Mustafa Vora, DY Patil University Navi Mumbai Scientists have recently discovered two types of novel brain cells that are found to be glial cells. One of them is called a gordita, which is an astrocyte, a type of glial cells. The name ‘gorditas’ comes from the plum-shaped appearance of the squat and round cell bodies. […]

Related Post

  • BiotechToday
  • World

Epstein-Barr virus & its relation with COVID-19

bioxone August 1, 2021July 31, 2021

Nandini Pharasi, Jaypee Institute of Information Technology About EBV (Epstein-Barr virus) The Epstein-Barr virus (EBV), commonly known as human herpesvirus 4, is a herpes virus that belongs to the herpes family. It’s one of the most prevalent viruses in humans. EBV is a virus that may be found all over the world. EBV infects the […]

Share this:

  • Share on Facebook (Opens in new window) Facebook
  • Share on X (Opens in new window) X
  • BiotechToday
  • World

Mushroom mycelium used to make biodegradable tech wearable

bioxone March 4, 2021March 4, 2021

Anannya Roy, Amity University Kolkata Imagine the mushroom you ate for lunch is the same mushroom that you used in your Fitbit smartwatch for measuring the number of footsteps while jogging. Fascinating isn’t it? Science fiction has become a reality in today’s world and yes all this is possible. We learned in our high school […]

Share this:

  • Share on Facebook (Opens in new window) Facebook
  • Share on X (Opens in new window) X
  • BiotechToday
  • World

Can PRESENCEOF HYALURONAN be used as a treatment option in COVID19 patients?

bioxone October 7, 2020October 7, 2020

Arpita Adhikary, Amity University Kolkata Deterioration of the respiratory system mainly the lungs is prominent in SARS COVID-19. An active agent, a jelly-like substance was found in dead COVID 19 patients’ respiratory system.  A novel fact is established by the researchers as they conducted an experiment comparing the diseased lung tissue of COVID patients and […]

Share this:

  • Share on Facebook (Opens in new window) Facebook
  • Share on X (Opens in new window) X

Breaking News

The Corrosion Prediction from the Corrosion Product Performance

Nitrogen Resilience in Waterlogged Soybean plants

Cell Senescence in Type II Diabetes: Therapeutic Potential

Transgene-Free Canker-Resistant Citrus sinensis with Cas12/RNP

AI Literacy in Early Childhood Education: Challenges and Opportunities

Sustainable Methanol Vapor Sensor Made with Molecularly Imprinted Polymer

Exogenous Klotho as a Cognition Booster in Aging Primates

Terms and Conditions
Shipping and Delivery Policy
Cancellation and Refund Policy
Contact Us
Privacy Policy