News and Blog

The e[catalog] enables both the build up of a single resource of legacy and current research datasets from a variety of experiments, and permits the federation of relevant disparate data sources. This resource can then be shared and viewed globally by all individuals across different departments of an organisation, from technical staff right through to senior management. 


The e[catalog] also allows for seamless collaboration both within the same organisation across various sites and with other external partner organisations.

Being aware of the many different projects and associated experimental datasets resulting from a variety of experimental approaches within an organisation can be of economical benefit, as this can ensure that a repeat of existing experiments on the same samples is prevented. Also researchers within the organisation will be aware of what subjects, samples and raw datasets have already been used in experiments that could potentially be available to be used for a new experiment, thereby saving time.

Datasets within e[catalog] are also readily accessible to undergo various analyses by a Bioinformatician or Statistician who can also reanalyse datasets independently when newer analytical tools and algorithms become available. Having the datasets accessible under a single resource will also permit comparisons and/or combining of similar datasets on which further relevant computational analytics via e[hive] (linked to via the e[catalog] resource) can be performed to provide new biological insights. Hence, addressing the current competency of the numerous computational analysis tools can therefore ensure that the best ones are streamlined and selected for activation within the e[hive] pipeline.

The simple click of the "Analyses" button within each Study of the e[catalog] resource will provide authority to the scientists themselves to seamlessly perform meaningful computational analyses on their own datasets, without the need to rely on other people or external analytical service providers. 

Big data technology data analysis datasets integration research ecatalog data catalog

Yasmin Alam-Faruque

About Yasmin Alam-Faruque

Biocurator, Yasmin Alam-Faruque is a member of Eagle Genomics' Biocuration team, joining in early 2014. "Why do I enjoy data curation at Eagle? It gives me the opportunity to find out about new industries, their areas of research, investigate and organise new datasets and work with the biomedical scientists who create and submit the data to make the data more accessible." Yasmin came to biocuration from a start as a bench scientist, and brings an understanding of biomedical science from an academic perspective, with an MSc in immunology comparing the immunological mechanism involved in corneal and skin graft rejection, a PhD in differential gene expression in mucosal cancers and postdoctoral experience in autoimmune skin disease. In her previous role as a scientific database curator at the European Bioinformatics Institute (EMBL-EBI), she worked on the Renal Gene Ontology Annotation Initiative, a project funded by the charity Kidney Research UK, to produce a resource that can be utilised in the interpretation of data from small- and large-scale experiments investigating molecular mechanisms of kidney function and development, providing new biological insights and thereby help towards alleviating renal disease. She also worked on the curation of various proteins, across species, in the UniProt Knowledgebase, including contributions to the Gene Ontology Annotation and the IntAct protein-protein interaction databases.