Representative image

Representative image
| Photo Credit: Getty Images

The story so far: The preliminary findings of the GenomeIndia project, which attempted to study whole genomes of 10,000 healthy and unrelated Indians from 83 population groups, were published in the journal Nature Genetics on April 8. After excluding two populations, the published findings are based on the genetic information of 9,772 individuals — 4,696 male participants and 5,076 female participants.

When was it launched?

The 10,000-human genome study was launched in January 2020 with funding from the Department of Biotechnology. Blood samples and associated phenotype data such as weight, height, hip circumference, waist circumference and blood pressure were collected from 20,000 individuals representing 83 population groups — 30 tribal and 53 non-tribal populations — spread across India. Of the 20,000 individuals, DNA samples from 10,074 individuals were subjected to whole genome sequencing, but later two populations were excluded.

Also read: What is ‘IndiGen’ project that is sequencing Indian genes?

The GenomeIndia project is a collaborative effort of 20 institutions. The genome sequencing was carried out by the Centre for Brain Research at IISc Bengaluru, the Centre for Cellular and Molecular Biology in Hyderabad, Institute of Genomics & Integrative Biology in Delhi, National Institute of Biomedical Genomics in Kolkata, and Gujarat Biotechnology Research Centre in Gandhinagar.

How were diverse samples collected?

A median of 159 samples from each non-tribal group and 75 samples from each tribal group chosen were collected from 83 population groups that inhabit over 100 distinct geographical locations to estimate the relatively rare mutations that are important to understand complex diseases. The samples were taken from unrelated individuals to ensure accurate estimation of mutation frequencies across groups. Three to six parent-child pairs were included in each population group to uncover de novo mutations (mutations that occur randomly in a child but not seen in parents).

Genomes of five tribes across India — Tibeto-Burman tribe, Indo-European tribe, Dravidian tribe, Austro-Asiatic tribe, and a continentally admixed outgroup — were sequenced. Genomes of three non-tribes — Tibeto-Burman non-tribe, Indo-European non-tribe, and Dravidian non-tribe — were also sequenced. Since language is an established proxy for genetic diversity in the Indian population, sampling was done to appropriately represent the four large major language families as well — Indo-European, Dravidian, Austro-Asiatic and Tibeto-Burman. However, the four ancient populations living in the Andamans, dating back 65,000 years ago, and two relatively modern populations from about 5,500 years ago, were not included.

What do the preliminary findings reveal?

In total, 180 million mutations have been found from the individuals sequenced; while 130 million variations are in the non-sex chromosomes (22 pairs of autosomes), 50 million mutations are in the sex chromosomes X and Y. It should not be surprising that 180 million mutations were found. The reason: the human genome has three billion base pairs of DNA and the genome of 9,772 individuals were sequenced. Most importantly, the 9,772 individuals belong to 83 distinctly different endogamous groups. Of that, the non-coding regions in the genome, which have DNA sequences that do not directly code for proteins, comprise 98%. A large number of the 180 million variants found in the sequenced genomes of 9,772 individuals are very likely to be present in the non-coding regions.

Polymorphisms or variations in the non-coding regions of the human genome, particularly the mutations that are evolutionarily conserved, will help in tracing evolutionary history. Tracing evolutionary history becomes important as many of the “contemporary Indian populations have originated from a few founding groups and have maintained distinct identities through centuries of endogamy.”

What is the significance of the mutations in endogamous groups?

Endogamy is highly prevalent in all the 83 population groups under study, though the degree varies. As a result of the centuries-long practice of endogamy, population-specific unique variations, including distinct disease-causing mutations with amplified frequencies, are likely to be seen within specific groups. While the global genomic landscape is predominantly Eurocentric, and other genome projects have documented genetic diversity, India, with its prolific and distinct endogamous populations, has been severely underrepresented in these studies. The study is therefore important for having captured the genetic diversity of “one of the highly underrepresented populations in the global genomics landscape”. Genetic mutations found associated with endogamous population-specific diseases will help the government to come up with targeted public health policies.

What are the medical implications?

The 130 million variations identified are expected to spur studies that aim to determine the possible roles of population-specific genetic mutations in various diseases. Understanding genetic variations can pave the way for precision medicine, ensuring treatments and interventions tailor-made for Indian genetic profiles. The data on variants associated with diseases will enable the development of affordable, genomics-based diagnostic tools, facilitating early detection, and prevention and management of diseases in India.



Source link

Share.
error: Content is protected !!
Exit mobile version