Omicscope pipeline¶
- omicscope.EnrichmentScope(OmicScope: Omicscope, Analysis: str = 'ORA', dbs: List[str] = ['KEGG_2021_Human'], padjust_cutoff: float = 0.05, organism: str = 'human', background=None) Enrichmentscope[source]¶
EnrichmentScope - Enrichment Analysis
EnrichmentScope is the module designed to perform over-representation and Gene-Set Enrichment Analyses of proteins and genes. In EnrichmentScope, several figures enable user to see enriched terms with their respective proteins.
- Parameters:
OmicScope (Omicscope) – Omicscope object
Analysis (str) – Over-representation Analysis (ORA) or Gene-Set Enrichment Analysis (GSEA). Defaults to ‘ORA’.
dbs (List[str]) – List of enrichment databases to perform the enrichment analysis. Defaults to [‘KEGG_2021_Human’].
padjust_cutoff (float, optional) – P-Adjusted cutoff . Defaults to 0.05.
organism (str, optional) – Organism to perform enrichment analysis. Defaults to ‘human’.
background (int, list, str, bool) – Background genes. By default, all genes listed in the gene_sets input will be used as background. Alternatively, user can use all genes evaluated in study (Recommended, background = True). Still, user can define a specific number (integer) to use as background (Not recommended), such as number of reviewed proteins in the target organism on Uniprot.
- Returns:
Enrichmentscope – Return a EnrichmentScope obj. The results is stored to obj.results.
- omicscope.Nebula(folder: str, palette: str = 'Dark2', pvalue_cutoff: float = 0.05) nebula[source]¶
Nebula - Multiple group comparison
Nebula is the module to integrate all data generated by OmicScope and EnrichmentScope pipelines.
- Parameters:
folder (str) – path to folder that contains all .omics files
palette (str) – Palette to assign colors and discriminate groups
pvalue_cutoff (float) – P-value threshold to consider differentially regulated proteins
- Returns:
Nebula – Return a Nebula obj.
- omicscope.OmicScope(Table: str, Method: str, ControlGroup: str | None = None, ExperimentalDesign: str = 'static', pvalue: str = 'pAdjusted', PValue_cutoff: float = 0.05, normalization_method: str | None = None, imputation_method: str | None = None, FoldChange_cutoff: float = 0.0, logTransform: bool = True, ExcludeContaminants: bool = True, degrees_of_freedom: int = 2, independent_ttest=True, **kwargs) Omicscope[source]¶
OmicScope - Differential Proteomics
OmicScope was designed to be compatible with several Proteomics software, such as Progenesis Qi for Proteomics, PatternLab V, MaxQuant, DIA-NN, ProteomeDiscoverer, and FragPipe.
- Additionally, users can also input data from other Omics sources (e.g.Transcriptomics),
using General and Snapshot methods. In General, users can analyse data in a pre-specified format using excel workbooks. On the other hand, Snapshot enables users to import pre-analyzed data into OmicScope quickly.
OmicScope are able to perform differential proteomics analysis, returning p-value, adjusted p-value (Benjamini-Hochberg approach), and fold-changes.
- Parameters:
Table (str) – Quantitative data.
Method (str) – Method used to import data.
ControlGroup (Optional[str], optional) – Control group. Defaults to None.
ExperimentalDesign (str, optional) – Experimental design to perform statistical analysis. Options: ‘static’, ‘longitudinal’. Defaults to ‘static’.
pvalue (str, optional) – Statistical parameter to consider entities differentially regulated. Options: ‘pvalue’, ‘pAdjusted’, ‘pTukey’. Defaults to ‘pAdjusted’.
PValue_cutoff (float, optional) – Statistical cutoff. Defaults to 0.05.
normalization_method (str, optional) – Data normalization can be performed. Options: Options: ‘average’, ‘median’, ‘quantile’. Defaults to None.
imputation_method (str, optional) – Impute values to data instead of NaN. Options: “median”, “mean”, “knn”. Defaults to None.
FoldChange_cutoff (float, optional) – Absolute fold-change cutoff. Defaults to 0.0.
logTransform (bool, optional) – Log-transform protein abundances. Defaults to False.
ExcludeContaminants (bool, optional) – Exclude the list of Contaminant proteins. Defaults to True.
degrees_of_freedom (int, optional) – Degrees of freedom to run longitudinal analysis. Defaults to 2.
independent_ttest (bool, optional) – while running a t-test, the user can specify if data sampling is independent (default) or paired (independent_ttest=False). Defaults to True.
- Returns:
OmicScope – Return a OmicScope obj. The quantitation data is stored to obj.quant_data.