Mapping the location of recurring amplicons in array-CGH data

Lingjærde, OC; Liestøl, K; Baumbusch, L; Størvold, HL; Børresen-Dale, AL

doi:10.1186/bcr1169

Volume 7 Supplement 2

The Third International Symposium on the Molecular Biology of Breast Cancer

Poster Presentation
Published: 17 June 2005

Mapping the location of recurring amplicons in array-CGH data

OC Lingjærde¹,
K Liestøl¹,
L Baumbusch²,
HL Størvold¹ &
…
AL Børresen-Dale²

Breast Cancer Research volume 7, Article number: P4.39 (2005) Cite this article

1225 Accesses
Metrics details

Background

Copy number alterations (CNAs) are believed to constitute key genetic alterations in the cellular transformation of many tumors [1]. Microarray-based comparative genomic hybridization (array-CGH) allows the construction of high-resolution genome-wide maps of copy number alterations, and statistical software packages are available for exploring and analysing array-CGH data (see, for example, [2, 3]), facilitating the delineation of the boundaries of CNAs in individual tumors and thereby localizing and identifying potential oncogenes and tumor suppressor genes. Although CNAs vary widely with respect to size and location, some genomic regions are known to have much higher prevalence of alteration than others. Mapping the location of these CNA hotspots facilitates location of genes of potential importance to tumor development as well as identification of alterations forming key steps in tumor development. There is, however, a need for consistent ways of combining array-CGH results for different arrays. Here, we present a statistical modelling-based approach for this.

Methods

Suppose we have available for each gene (clone) on an array a binary (0/1) variable indicating whether the gene is amplified or not. Such data may be constructed from array-CGH data using one of the aforementioned software packages. Each tumor may then be represented by an m-dimensional binary vector, where m is the number of genes on the array. For an experiment involving n tumors we thus have a set of m-dimensional vectors z₁, ..., z_n and we consider the latter to be realizations from a multivariate distribution P(z). We consider three models for P(z) of increasing sophistication. The first assumes complete independence between genes, the second assumes a Markov-chain dependence structure and the third assumes a Markov Random Field dependence structure [4]. We demonstrate how P(z) can be estimated in each case and show that, by suitable constrained maximization of P(z), we may determine genomic intervals corresponding to probable occurring intervals of copy number alteration.

Results

The method is demonstrated (for all three models) on simulated binary copy number status data for varying number of genes and tumors. We also demonstrate the use on real array-CGH data that have been processed by CGH-Explorer [2] in order to obtain a binary copy number status vectors for each tumor.

Conclusion

We have proposed a novel statistical method for the derivation of probable intervals of CNA, based on copy number status data from a sample of tumors. The method is based on a probabilistic model for the copy number status in a tumor, and we have discussed three models of increasing sophistication. The most basic of the three models corresponds to simply reporting all genes that are amplified in at least k% of the tumors. The other two models take into consideration the important fact that neighboring genes are not, in general, altered independently of each other. Utilizing this property of copy number data allows derivation of probable intervals of CNA that are less prone to noise degradation than alternative methods. In addition, results are derived in the context of a well-defined probabilistic framework and are therefore more easily interpretable.

References

Lengauer C, Kinzler KW, Vogelstein B: Genetic instabilities in human cancers. Nature. 1998, 396: 643-649. 10.1038/25292.
Article CAS PubMed Google Scholar
Lingjærde OC, Baumbusch LO, Liestøl K, Glad IK, Børresen-Dale AL: CGH-Explorer: a program for analysis of array-CGH data. Bioinformatics. 2005, 21: 821-822.
Article PubMed Google Scholar
Wang P, Kim Y, Pollack J, Narasimhan B, Tibshirani R: A method for calling gains and losses in array CGH data. Biostatistics. 2005, 6: 45-58. 10.1093/biostatistics/kxh017.
Article PubMed Google Scholar
Cressie NAC: Statistics for Spatial Data. 1993, New York: John Wiley & Sons
Google Scholar

Download references

Author information

Authors and Affiliations

Bioinformatics Group, Department of Informatics, University of Oslo, Norway
OC Lingjærde, K Liestøl & HL Størvold
Department of Genetics, Institute for Cancer Research, The Norwegian Radium Hospital, Oslo, Norway
L Baumbusch & AL Børresen-Dale

Authors

OC Lingjærde
View author publications
You can also search for this author in PubMed Google Scholar
K Liestøl
View author publications
You can also search for this author in PubMed Google Scholar
L Baumbusch
View author publications
You can also search for this author in PubMed Google Scholar
HL Størvold
View author publications
You can also search for this author in PubMed Google Scholar
AL Børresen-Dale
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lingjærde, O., Liestøl, K., Baumbusch, L. et al. Mapping the location of recurring amplicons in array-CGH data. Breast Cancer Res 7 (Suppl 2), P4.39 (2005). https://doi.org/10.1186/bcr1169

Download citation

Published: 17 June 2005
DOI: https://doi.org/10.1186/bcr1169

The Third International Symposium on the Molecular Biology of Breast Cancer

Mapping the location of recurring amplicons in array-CGH data

Background

Methods

Results

Conclusion

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Keywords

Breast Cancer Research

Contact us

The Third International Symposium on the Molecular Biology of Breast Cancer

Mapping the location of recurring amplicons in array-CGH data

Background

Methods

Results

Conclusion

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Breast Cancer Research

Contact us