The amount of data in a microbiome dataset depends on the location of the sample taken from, for example a human subject or environmental location under research. The graph below shows differences in the number of species within a certain site of the human body. Gut and Skin microbiomes contain a higher number of species compared to those within the Respiratory and Urogenital as they are more exposed to many external environmental microbiomes from food, drugs, clothing and skincare product application.
A vast amount of data can be collected when studying the human microbiome and this data can differ depending on both the environment (parts of the body) that the microbiome occupies and the technology used to generate the data. When compared to the human genome, the potential amount of sequence data that could be collected when analysing just the gut microbiome is much greater, as shown in the table below.
To further complicate things, the microbiome data of individuals can also be different depending on a number of other factors , including the following:
- Country the individual lives in
- People around them
- Age of individual
- Their diet
- Personal care routine (skincare, toothpaste etc)
- Personal lifestyle habits e.g. smoking
- Their health
If we consider one example, age of individuals, we can see the effect of ageing on the gut microbiome as shown in the diagram below. The Firmicutes population is lower in an elderly subject compared to a young adult and also the Lentisphera population has virtually disappeared.
|A - Elderly Subjects||B - Younger Adults|
These figures (taken from the paper Composition, variability and temporal stability of the intestinal microbiota of the elderly.) show the differences in microbiota composition at phylum level in (A) 187 samples from elderly subjects and (B) 9 healthy younger adult controls.
What is different about the data?
Microbiomes are comprised of different taxonomic groups, and different compositions of different types of cells within that population. Fundamentally it is the same type of data, but the data formats vary due to the different technological and analytical approaches taken.
For the latest information that may help to understand microbiome data, take a look at the 'Tools/Databases' category of our microbiome knowledge base: https://www.eaglegenomics.com/microbiome-knowledge-base/
Number of genes and Gb of sequence data from: https://www.nature.com/articles/nature08821 and https://bitesizebio.com/8378/how-much-information-is-stored-in-the-human-genome/