Skip to content
Tagged COVID-19 Biotechnology SARS-CoV-2 Life Science cancer CORONAVIRUS pandemic
BioXone

BioXone

rethinking future

May 17, 2025
  • About
  • BiotechTodayNews
    • IndiaWeekly Biotech News of India
    • WorldWeekly Biotech News of The World
  • DNA-TalesArticles
    • BiotechnopediaInteresting articles written by BioXone members and associates.
    • Scientists’ CornerArticles from the pioneers of Biotechnology.
    • Cellular CommunicationInterview of greatest researchers’ in the field.
  • Myth-LysisFact Check
  • Signalling PathwayCareer related updates
    • ExaminationsExamination related articles.
    • Job and InternshipJobs and Internship related articles.
  • Courses
  • Contact

Most Viewed This Week

October 17, 2023October 16, 2023

The Corrosion Prediction from the Corrosion Product Performance

1
October 1, 2023September 30, 2023

Nitrogen Resilience in Waterlogged Soybean plants

2
September 28, 2023September 28, 2023

Cell Senescence in Type II Diabetes: Therapeutic Potential

3
September 26, 2023September 25, 2023

Transgene-Free Canker-Resistant Citrus sinensis with Cas12/RNP

4
September 25, 2023September 25, 2023

AI Literacy in Early Childhood Education: Challenges and Opportunities

5
September 22, 2023October 1, 2023

Sustainable Methanol Vapor Sensor Made with Molecularly Imprinted Polymer

6

Search Field

Subscribe Now

  • Home
  • BiotechToday
  • Population-scale long-read sequencing and its approaches

ECMO: An artificial heart-lung set for COVID-19 treatment

Novel brain cells named “Gorditas" and "OPC" discovered

Population-scale long-read sequencing and its approaches
  • BiotechToday
  • World

Population-scale long-read sequencing and its approaches

BioTech Today June 22, 2021June 22, 2021

Sumedha B S, Bangalore University

Long-read sequencing (LRS), also called third-generation sequencing, offers a number of advantages over short-read sequencing such as Illumina’s NovaSeq, NextSeq, HiSeq and MiSeq instruments. Long-read sequencing technologies could permit the assembly of genomes, which is capable of revolutionizing genomics. It has the potential to reveal the full spectrum of human genetic variation, which would help in resolving some of the missing heritability also, leading to the discovery of new pathogenesis and mechanisms of diseases.

Currently, long-read sequencing technologies have reached a level of precision enabling application to variant detection in tens to thousands of samples. Advances in sequencing and bioinformatics have made it possible to achieve population-scale long-read sequencing. Pacific Biosciences (PacBio) and Oxford Nanopore Technologies (ONT) are the two major competitors working towards innovation in this field. Statistical approaches in genetics help to identify variants that correlate with a certain phenotype like a disease. Population-scale sequencing is defined here as the sequencing of more than 5 genomes, but in the case of limited genomic diversity, a lower number of genomes are sufficient.

Previous population studies, including genome-wide association studies (GWAS), have not been able to exhaustively characterize the genetic factors underlying human traits and diseases. However, a significant proportion of hidden variants could be discovered with long-read sequencing. Recent LRS projects involving Icelandic and Chinese populations have identified hidden variants related to anaemia, height, cholesterol levels. LRS is beneficial for improving the range, continuity, accuracy of variant phasing, and assessment of small variants. This has been used to find disease-associated alleles.

The largest human-focused population-scale long-read sequencing study examined the genomic diversity of 3,622 Icelandic genomes. As part of the Human Pangenome project, LRS of a global cohort diversity is being carried out. Aside from human studies, long-read sequencing has been applied on a population scale to discover structural variation associated with phenotypes in fruit flies, songbirds and crops.

Structural variants (SVs) are genomic alterations that are 50bp or larger, it includes deletions, duplications, insertions, inversions and translocations.

Project strategy needed to be considered: The total number of sequenced individuals or chromosomes should be as high as possible.

Different approaches are:

 A full coverage approach– It is the most expensive of the three approaches. Highest level of resolution is obtained with this strategy. All the samples receive similar coverage and are equally well studied. It aims to sequence each sample of population with medium to high coverage. The advantage of this approach is the simplicity of study design, comprehensiveness, easy detection of rare variations and relatively straightforward workflow.

A mixed coverage approach– Here, a subset of samples, representative of the subgroups in the subpopulations are sequenced at high and the rest at low coverage. This approach is less expensive than the full coverage approach, and it achieves high detection sensitivity. It is suitable for studies with a limited budget or a high number of individuals. But there will be a bias towards common alleles with this approach, as many rare alleles may be missed.

A mixed sequencing approach– This involves LRS of just a few samples- 10-20% and short-read sequencing of the remaining.  The basis of this approach is similar to selecting individuals for high coverage in mixed coverage strategy.

Other approaches developed:

Sequencing logistics. It involves efficiently operating long-read sequencers, from logistics to sample preparation, loading optimizations and run monitoring. ONT and PacBio have different advantages. It also has its own challenges in almost every step due to the different designs of flow cells and sequencing instruments. An adequate amount of high molecular weight DNA and highly pure input DNA are required.

Analytical considerations the main challenge in population-level studies is a scalable and streamlined analysis. Two main strategies for downstream analysis: aligning reads from individual samples to a single reference genome and comparing de novo assemblies. These methods are significantly different in the computational and coverage requirements. That depends on the complexity and size of the genome.

Read alignment-based analysis. This is often the most common method of choice for population-scale studies. This enables the comparison of all samples with a reference genome and is the reason why more than half of population studies use this. These methods are less computationally demanding.

Population-scale de novo assemblies. These approaches are very sensitive and used to reconstruct diverse regions of the genomes. This can also lead to a collapse of highly similar segmental duplications. Algorithms that leverage single-nucleotide variants (SNVs) that differentiate multiple copies of repeats are used. The main challenge faced is the correct representation of the ploidy.

Graph genome methods. These allow the study of variants that are undetected by the current hi-tech short-read SV discovery methods. Tools, such as GraphTyper2100, Paragraph101 and tools from the vg package45,96, have been developed to graph genome structures.

Variant validation and genotyping. In this,any variants showing polymorphic genotypes are excluded. This approach neglects that some types of SV have higher mutation rates which are responsible for possible repeated mutations. It delivers the first step towards a more reliable SV genotyping. This method has recently been used for the corvids crows and jackdaws successfully.

The development of different approaches will have a profound impact on improved variant representation and complexity of the underlying biology. However, this would require a shift from a linear to a more complex form of the reference genome. PacBio and ONT are currently leading in the development of LRS for multiple applications. Other companies (such as, Base4, Quantapore, Omniome) are developing novel long-read approaches, whose accountability needs to be assessed in the coming years. This field is very rapidly developing the area of genomics and established tools quickly become outdated and are replaced by new ones.

Also read: The curious case of Covid-19 Re-infection

References:

  1. De Coster, W., Weissensteiner, M.H. & Sedlazeck, F.J. Towards population-scale long-read sequencing. Nat Rev Genet (2021). https://doi.org/10.1038/s41576-021-00367-3
  • The Corrosion Prediction from the Corrosion Product Performance
  • Nitrogen Resilience in Waterlogged Soybean plants
  • Cell Senescence in Type II Diabetes: Therapeutic Potential
  • Transgene-Free Canker-Resistant Citrus sinensis with Cas12/RNP
  • AI Literacy in Early Childhood Education: Challenges and Opportunities

Share this:

  • Click to share on Facebook (Opens in new window) Facebook
  • Click to share on X (Opens in new window) X

Related

Tagged Bioinformatics Biostatistics DNA sequencing Genetics genome-wide association studies genomics long-read sequencing population studies sequencing structural variants whole genome sequencing

One thought on “Population-scale long-read sequencing and its approaches”

  1. Pingback: The human-associated clade of S. suis. - BioXone

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Next Post
  • BiotechToday
  • World

Novel brain cells named “Gorditas" and "OPC" discovered

BioTech Today June 22, 2021

Mustafa Vora, DY Patil University Navi Mumbai Scientists have recently discovered two types of novel brain cells that are found to be glial cells. One of them is called a gordita, which is an astrocyte, a type of glial cells. The name ‘gorditas’ comes from the plum-shaped appearance of the squat and round cell bodies. […]

Related Post

  • BiotechToday
  • World

Malnutrition & its long term effects on COVID-19 severity

bioxone July 31, 2021July 31, 2021

Anjali Kumari, IILM College of engineering and technology The patients who have a history with the diagnosis of malnutrition have an increased need for ventilation and risk of death when they test positive for the noble Coronavirus, as suggested in the new research. If there are no strict actions taken immediately, the ongoing pandemic will […]

Share this:

  • Click to share on Facebook (Opens in new window) Facebook
  • Click to share on X (Opens in new window) X
  • BiotechToday
  • World

What is your brain up to when you’re just walking?

BioTech Today April 9, 2022April 9, 2022

Sribas Chowdhury, Adamas University, Kolkata: Cognition is the brain’s ability to process any piece of information according to the perspective of a specific person. Different people have different sorts of cognitive abilities and specific neural patterns which lead to such cognition. While much research has been done to understand the relation of cognition with neural […]

Share this:

  • Click to share on Facebook (Opens in new window) Facebook
  • Click to share on X (Opens in new window) X
  • BiotechToday
  • India

Nuclei of each muscle fiber differ in terms of gene activity

bioxone December 16, 2020December 16, 2020

Camelia Bhattacharyya, Amity University Kolkata Since our school days, we have known Cells to contain a single nucleus. Muscle cells, on the other hand, are different. They are known for containing hundreds of nuclei inside a large cytoplasm.  Studies have shown that the gene activity of these nuclei differs from each other and can affect […]

Share this:

  • Click to share on Facebook (Opens in new window) Facebook
  • Click to share on X (Opens in new window) X

Breaking News

The Corrosion Prediction from the Corrosion Product Performance

Nitrogen Resilience in Waterlogged Soybean plants

Cell Senescence in Type II Diabetes: Therapeutic Potential

Transgene-Free Canker-Resistant Citrus sinensis with Cas12/RNP

AI Literacy in Early Childhood Education: Challenges and Opportunities

Sustainable Methanol Vapor Sensor Made with Molecularly Imprinted Polymer

Exogenous Klotho as a Cognition Booster in Aging Primates

Terms and Conditions
Shipping and Delivery Policy
Cancellation and Refund Policy
Contact Us
Privacy Policy