Consensus Gene Set Analysis tool
Select input type


[?]
Load Data file
Select type of file

[?]
Load Setup file
[?]

[?]
Load input gene statistics file

[?]

Select Gene set collection type


[?]
Load Gene2Geneset:
[?]
Select organism
[?]
Affymetrix attribute


Gene set size
Lower limit Upper limit genes [?]
Consensus Analysis
 
                                     
 Gene level statisticsGene set analysis methodP-value calculation method No of permutations for P-value calculationAdjustment Method

Email (Optional)
[?]

Run

Input data

To load a set of not normalized CEL files as raw data they should be compressed in a .zip archive with a name without spaces. It is also possible to load already normalized data in a text file, in which the first column should be the probeset IDs and correspond to the ones used in the annotation.

(Back to General Instructions)

Contact

Enter your E-mail Id for getting the result in your mail.

(Back to General Instructions)

Select comparison

Click on 'Upload' in order to see the conditions that are assigned to each microarray. You are required to define one condition that is to be compared,

(Back to General Instructions)

Gene level statistics

The gene-level statistics are simply represented by a one-columned dataframe (or a named vector) of numeric values (usually p-values or t-values from a differential expression analysis) with some kind of gene IDs as rownames.

(Back to General Instructions)

Gene to geneset

The gene set collection should describe the grouping of genes into gene sets.Using a two-column mapping of all gene to gene set association is a simple way to load custom gene set collections. Note that the gene names in the gene set collection have to match the gene names used for the gene-level statistics.

(Back to General Instructions)

Load data

To load a set of CEL files as raw data they should be compressed in a .zip archive with a name without spaces. It is also possible to load already normalized data in a text file, in which the first column should be the probeset IDs and subsequent columns contain the normalized data for each sample. The first line is a header with the sample names (also used in the setup-file). Avoid using sample names starting with numbers. See the example files under General instructions.

(Back to General Instructions)

Load Setup

The setup file should describe the experimental setup assigning each sample to a specific condition. The format of the file should be as follows: The first column should contain the names of the CEL files (or the sample names used in the header of the normalized data file) and additional columns should assign attributes in some category to each array. See the example files under General instructions.

(Back to General Instructions)

Extract filter

The user can get a list of affymetrix attribute for the selected organism from which one should be selected for generating genetogene sets

(Back to General Instructions)

Consensus gene set analysis

Gene set analysis uses gene-level statistics (e.g. from microarray or RNA-seq analysis) together with a gene set collection (e.g. GO-terms) in order to identify gene sets that are significantly enriched by high-scoring genes (generally differentially expressed genes). Here, different methods for calculating gene set statistics can be choosen. By running different methods with the same input a consensus gene set result can be calcualted. This scores the gene sets highest if they are found significant in all the run methods.

(Back to General Instructions)

Gene set collection

Either a custom gene set collection can be loaded from a file or Gene Ontology Terms can be used as gene sets.

(Back to General Instructions)

General Instructions

Click "[?]" to display instructions here, or navigate through the links below.

Tool description
Input instructions
Output description

Example files to download

Normalized data
Raw data
setup file
Gene statistics file
Gene to geneset file
Click Load Example microarray data and gene to geneset file
OR
Click Load Genestatistics and gene to geneset file and hit Run

Input Instructions

Two inputs are required. First, gene-level statistics provide one value per gene (e.g. p-value or t-value), additionally the fold-change can also be provided in order to have the information about the direction of change. Second, a gene set collection is needed that assigns genes into one or several functional groups.

(Back to General Instructions)

Select input type

Either raw microarray CEL-files or pre-normalized microarray data can be loaded and processed in order to generate the gene-level statistics that are required as input to the gene set analysis. Gene-level statistics generated in other ways or from other platforms (RNA-seq etc) can also be loaded from a text-file.

(Back to General Instructions)

Gene set size

The gene sets will be defined only by the genes that there exists data for. The size limits enables the discarding of too small and too large gene sets from the analysis.

(Back to General Instructions)

Select organism

Select the organism for which the dataset has to be fetched from BioMart

(Back to General Instructions)

Output file description

This tool aims for the classification of gene sets from different conditions. A consensus heat map showing the different up and down regulated genes is the output.

(Back to General Instructions)


Online tools powered by piano

Citations
1) Väremo L, Nielsen J, Nookaew I ( 2013) Enriching the gene set analysis of genome-wide data by incorporating directionality of gene expression and combining statistical hypotheses and methods Nucleic Acids Res. Apr;41(8):4378-91
A * indicates that these authors contributed equally to the work.