NCBI BioSample (INSDC), GSA BioSample and Viral Genbank.
Increase in exploitable coordinates extracted from metadata.
Host Metagenomes were harmonized from metadata and associated to organism-based variables.
ENVO terms, IUCN Ecosystem and more coordinate-based variables.
Our Dataset is made of multiple tables:
Our data lake architecture allows for flexible and evolutive tables, without the strict rules of structured SQL-like databases. It relies on PRABI-owned infrastructure.