Copy number aberrations / Match copy number and expression features

Description

Matches the features of a copy number data set with features of an expression data set using their chromosomal locations. Running this tool is a prerequisite for testing copy-number-induced effects on expression. The copy number data set can be either a microarray (aCGH) or an NGS (CNA-seq) one. For the expression data, only microarrays have been tested, but in the future RNA-seq or miRNA-seq might also be applicable.

Parameters

Details

First samples without matching pair in the other data set are removed from the analysis. Then copy number and expression features are paired with each other using the specified method. The details of the methods are as follows.

Matching occurs on the basis of genomic locations. In case of "distance", the midpoint of copy number and expression features are calculated and for each feature in the expression data set the closest feature of the copy number platform is selected. If method is "overlap", each gene in the expression data set is matched to the feature from the copy number platform with the maximum percentage of overlap. If the maximum percentage of overlap equals zero, the gene is not included in the matched objects. If method is "overlapplus", the features are first matched by their percentage of overlap (as with the "overlap" method). For all non-matched expression features its closest two copy number features (one down- and one upstream) are determined. If the copy number signature of these two copy number features is identical, intrapolation seems reasonable, and and the expression feature is matched to the closest of these two copy number features. Hence, method "overlapplus" makes use of the copy number data, consequently, matching may be different for different data sets.

Output

A table of matched copy number and expression features. This file can be used to test for copy-number induced expression changes ("Test for copy number induced expression changes" tool) and to plot profiles of individual samples ("Plot profiles of matched copy number and expression" tool).

A heatmap showing the two dataset organized by chromosomal position is also generated.

References

Wieringen et al. (2009) Nonparametric testing for DNA copy number induced differential mRNA gene expression. Biometrics 65: 19-29