alon1999_set

All data has been taken from the article: U. Alon, N. Barkai, D. A. Notterman, K. Gish, S. Ybarra, D. Mack, and A. J. Levine "Broad patterns of gene expression revealed by clustering of tumor and normal colon tissues probed by oligonucleotide arrays", Proc. Natl. Acad. Sci. USA, 96 (12), 6745-6750 (1999) [ MEDLINE abstract; PNAS full text ].

Data contain the measured expression levels for 2000 human cDNAs and ESTs (including the sequences that are homologous to some known eukaryotic genes) in colon adenocarcinomas from several patients. For some patients, the expression of these RNAs was also measured in normal colon tissues. Totally, the table contains expression measurements for 40 tumorous and 22 normal colon tissues. These data are combined in appropriate groups: "Tumor" and "Normal".

Original data has been taken from the site Microarray Databases (November 2002 ) Data pertaining to the article ‘Broad patterns of gene expression revealed by clustering of tumor and normal colon tissues probed by oligonucleotide arrays’ page. Expression data are taken from the I2000.html, file, displayed on this page. The order and numeration correspond to data from the tissues.html, file with '-' symbol. Names of normal tissues begin with letter "N". Sequences' identifiers have been taken from the names.html, file also displayed on this page (the order of IDs corresponds to that in the I2000.html file). On formation of SELTAG data, the columns 1 and 2 of the file were interchanged. The resulting file of data in SELTAG format (alon1999_set.txt) has been formed by direct integration of sequences' descriptors (fields F1-F6) and data on expression (fields F7-F68).

The SELTAG software was used for hierarchical clustering of experiments form the "alon1999_set" by means of various agglomerative approaches. The results are represented in the alon1999_set.doc file are in accordance with that in the article Alon et al (1999).

Complete list of files for "alon1999_set" data that are on Softberry server:

File Size Description
alon1999_set.txt 1,471 kb Data in SELTAG format
alon1999_set.descr 191 kb Links for genes Unigene (sorted by their GenBank AC, field F1 "SequenceId" from file alon1999_set.txt)
alon1999_set.seq  2,999kb Genes' sequences in FASTA format
alon1999_set_example.pdf  523 kb Example of data analysis by mean of SELTAG (pdf)
alon1999_set_example.html  31 kb Example of data analysis by mean of SELTAG (html)
alon1999_set_I2000_ori.txt 1,806 kb Original "I2000" file with data on genes expression
alon1999_set_names_ori.txt 531 kb Original "names" file with sequences' descriptors
alon1999_set_tissues_ori.txt 1 kb Original "tissues" file with tissues marking
alon1999_set_descr_ori.txt 3 kb Description of original data
alon1999_set_paper.pdf 732 kb Original article Alon et al (1999)

Back to all dataset page