Metab8D: A metabolic regulome network from oct-omics and machine learning

Introduction

Metabolite variation across cells may arise due to differences in regulation at the transcriptional, post-transcriptional, translational, and post-translational levels. Metab8D utilizes eight omics classes (genomics (CNV and mutations), histone PTMs, DNA methylation, transcriptomics, RNA splicing, miRNA, lncRNA, proteomics, and phosphoproteomics) to predict metabolomic variation across cancer cell lines from the Cancer Cell Line Encyclopedia, thereby inferring a multiomic metabolic regulatory network. This repository contains the machine learning workflow Metab8D employs to assess the relationship between features and their respective predicted metabolites. The primary output for this work is captured in the Metab8D_network.xlsx file, which contains the top 20 features in predicting each metabolite across 9 omics datasets.

Methods

As outlined in the ML_function.ipynb file, Metab8D network generation involves training ML models, assessing their accuracy, and obtaining feature importances thereof. This process is repeated for each individual omics input. An example of model generation is provided at the bottom of the ML_function.ipynb file using the histone PTM data. A requirements.txt file is provided, specifying all necessary packages for running this code.

Results

The resultant proposed regulome network can be found in the Metab8D_network.xlsx file, where the top 20 features for all 2,025 trained metabolite models, along with their respective confidence scores, may be found. Features with high confidence scores and no previously identified mechanistic relationship should be prioritized for further study.

Conclusions

By comparing accuracies and assessing feature importance scores from multiomic inputs, Metab8D proposes systems-level relationships between omics features and metabolites across the CCLE cell line panel, and validates such relationships in independent data.

File descriptions

Metab8D_network.xlsx: Top 20 features for 225 RF metabolite models from 9 omics inputs along with confidence scores (0 through 8) based on the number of controls (out of 8 experiments) for which each feature appeared in the top 20 most imoprtant features.

ML_function.ipynb: ML script for random forests, XGBoost, ridge regression, and lasso regression, as well as feature importance generation. Includes example code for using histone PTM data as input.

RF_results: Pearson's correlations and P values for all metabolite models from each of 8 omics classes. Significance is determined by Bonferroni-corrected P value.

recon_mapping: MATLAB and Python scripts for extracting genes from reactions involving metabolites of interest and matching them with top feature lists, as well as a csv containing common gene name to BiggID translations.

human_1_mapping.csv: List of metabolites from Recon3D with mapped Human1 IDs.

example_datasets: Metabolomics and histone PTM (original and zero imputed) data that can be used to test the machine lerning code present in this repository. The rest of the preprocessed CCLE data and trained models may be found here: https://doi.org/10.7303/syn68236153

preprocessing_example.ipynb: Example code for preprocessing the original histone PTM data along with examples of z-score normalization and KNN imputation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Metab8D: A metabolic regulome network from oct-omics and machine learning

Introduction

Methods

Results

Conclusions

File descriptions

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 118 Commits
RF_results		RF_results
example_datasets		example_datasets
recon_mapping		recon_mapping
ML_function.ipynb		ML_function.ipynb
Metab8D_network.xlsx		Metab8D_network.xlsx
README.md		README.md
human_1_mapping.csv		human_1_mapping.csv
preprocessing_example.ipynb		preprocessing_example.ipynb
requirements.txt		requirements.txt

sriram-lab/Metab8D

Folders and files

Latest commit

History

Repository files navigation

Metab8D: A metabolic regulome network from oct-omics and machine learning

Introduction

Methods

Results

Conclusions

File descriptions

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages