Chapter 9: Data analysis in metabolomics: from information to knowledge

by Jacopo Troisi, Giovanni Troisi, Giovanni Scala, Sean M. Richards 

Abstract

Data analysis is a pivotal step in the metabolomics investigation pipeline as it allows to extract knowledge from datasets and get new insights and new perspectives as well as generate new hypotheses about biological complex processes. Data analysis uses various strategies including exploratory analysis which tends to evaluate metabolites individually and machine learning algorithms which in turn can be unsupervised (data driven) or supervised (task driven) allowing the study of the interplay of the several investigated metabolites. Among the first, particularly important in metabolomics are the clustering algorithms and the principal components analysis. Among the latter, a prominent role is played by partial least square discriminant analysis, artificial neural networks, supported vectors machines. These algorithms as well as solo, can be combined in ensemble models. Particular emphasis is placed in this chapter on strategies for evaluating the effectiveness of machine learning algorithms trainings, including validation strategies, features selection mechanisms and hyperparameter optimization.

Jacopo Troisi

Department of Medicine, Surgery and Dentistry “Scuola Medica Salernitana” University; Theoreo srl – spin off company of the University of Salerno, Italy; Department of Chemistry and Biology “A. Zambelli”, University of Salerno, Italy 


Jacopo Troisi is visiting professor of Metabolomics in both Salerno University and University of Sannio in Italy. He is a Research Scientist and COO at the European Biomedical Research Institute of Salerno (EBRIS), co-founded by the Harvard University and the Salerno Medical School. He is also co-founder and CEO at Theoreo srl, a spin-off company of the University of Salerno. Prof. Troisi research is related to metabolomics. In particular, he studies the metabolomic fingerprint of several human diseases to develop diagnostic tools. He has received several international patents for diagnostic tests based on metabolomic profiling. Dr. Troisi has published widely in such peer reviewed journals as Science, Nature Microbiology, Nature Communications, Frontiers in Immunology, Nutrients, BMC Microbiology, Metabolomics, and Scientific Reports.

Contact

Giovanni Troisi

Theoreo srl – spin off company of the University of Salerno, Italy

Giovanni Troisi is an informatic engineer, technical manager of research and development sector for impedentiometry applied to the human body in a big USA company. He supports one of the major Italian government bodies in the Salerno area for the processing of personal, financial and health data of citizens. Finally, he is fully involved in the field of artificial intelligence for the early recognition of new diseases.
Combining his studies in computer engineering with a passion for metabolomics and cellular nutrition, he has developed the technology that has collected and analyzed a huge amount of data coming from University, Hospital and Specialistic diagnostic center tests.
When he isn’t glued to a computer screen, he loves to play sports, enjoy long nature walks with his baby in pouch and trying very hard not to be the worst Russian language pupil in his living area.
He is currently committed to making wellness and health technology personal and within everyone's reach.

Giovanni Scala

Theoreo srl – spin off company of the University of Salerno, Italy

Chemist, CTO and cofounder of Theoreo. He deals with the commercial and marketing area. His experience is in the field of advanced analytical technologies, in particular mass spectrometry and chromatographic  separation.  He  coordinates  several  R&D  projects  granted  by  the  European  Union  and managed by Campania Region. These projects deal with mass spectrometry and chromatography applied to various  fields:  metabolomics, diagnostics,  food  farming,  pollution.  From  2006  to  2010  he  collaborated  with  the  GCSAR (General  Commission  for  Scientific  Agriculture  Research,  Ministy  of  Agriculture)  in  Damasco,  in  a  project dealing with oil and olive production. His current research is related to metabolomics. In particular, he studies the metabolomic fingerprint of several human diseases to develop diagnostic tools. He has received several international patents for diagnostic tests based on metabolomic profiling.

Sean M. Richards

Department of Biological and Environmental Sciences, University of Tennessee-Chattanooga, Chattanooga, TN 37403 - Department of Biological and Environmental Sciences, University of Tennessee-Chattanooga, Chattanooga, TN 37403, USA; Department of Obstetrics and Gynecology  University of Tennessee Health Science Center College of Medicine Chattanooga, TN 37403 

Over the last 25 years Dr. Richards has conducted research on the exposure and effects of multiple toxicants on organisms ranging from bacteria to humans. These toxicants include herbicides, insecticides, pharmaceuticals, Bisphenol A, Poly Aromatic Hydrocarbons, Per- and Polyfluoroalkyl Substances, metals, and Poly Chlorinated Biphenyls. Dr. Richards also has experience with the laws and regulations which govern exposure to toxicants through air, soil, and water. 
Most recently, Dr. Richards has focused on metabolomics.  Specifically,  the alterations of metabolomes in response to toxicants or in populations of humans affected by endometrial cancer, congenital malformations, endometriosis, polycystic ovarian syndrome, and fetal central nervous system anomalies.  The goal of this research is to determine specific biochemical pathways associated with the affected populations. Ultimately, understanding the changes in the metabolome as a result of disease or intoxication can lead to early detection and personalized treatments.
Contact