|
Modeling Dependencies in Protein-DNA Binding Sites:
Analysis of Yeast Clusters |
|
Another rich collection of datasets of genes were collected by
Hughes et al, 2000.
These clusters of genes are based on functional annotations, and were originally analyzed using
AlignACE. This analysis included multiple runs of AlignACE, followed by
filtering based on the quality of the motifs found. The best PSSMs were
reported for each cluster.
To gauge the quality of our baseline method, we compared the PSSMs learned by our procedures to the ones learned and reported by Hughes etal. For this task we used the whole training data (as done by AlignACE), and examined the two learned motifs for each group by comparing their sensitivity, specificity and their hypergeometric p-value. In addition, we present the runs of AlignACE, using the default parameters. All results, including raw information, are shown in the following table. |
|
# |
NAME |
PSSM |
Huges et. al |
Default AlignACE |
|
1 |
aminoacid_biosynthesis |
|||
|
2 |
aminoacid_metabolism |
|||
|
3 |
assembly_of_protein_complexes |
|||
|
4 |
biogenesis_of_cell_wall |
|||
|
5 |
budding_cell_polarity_and_filament_formation |
|||
|
6 |
carbohydrate_utilization |
|||
|
7 |
cell_cycle_control_and_mitosis |
|||
|
8 |
cell_growth |
|||
|
9 |
cellular_import |
|||
|
10 |
cytoplasmic_degradation |
|||
|
11 |
detoxificaton |
# |
NAME |
PSSM |
Huges et. al |
Default AlignACE |
|
12 |
dna_synthesis_and_replication |
|||
|
13 |
glucose_metabolism |
|||
|
14 |
homeostasis_of_other_ions |
|||
|
15 |
lipid_fattyacid_and_sterol_biosynthesis |
|||
|
16 |
meiosis |
|||
|
17 |
metabolism_of_vitamins_cofactors_and_prosthetic_groups |
|||
|
18 |
mitochondrial_organization |
|||
|
19 |
mitochondrial_transport |
|||
|
20 |
nuclear_organization |
|||
|
21 |
organization_of_cytoplasm |
|||
|
22 |
organization_of_cytoskeleton |
# |
NAME |
PSSM |
Huges et. al |
Default AlignACE |
|
23 |
organization_of_endoplasmatic_reticulum |
|||
|
24 |
organization_of_golgi |
|||
|
25 |
organization_of_plasma_membrane |
|||
|
26 |
other_transcription_activities |
|||
|
27 |
other_transport_facilitators |
|||
|
28 |
pheromone_response_matingtype_determination_sexspecific_proteins |
|||
|
29 |
proteases |
|||
|
30 |
protein_kinase |
|||
|
31 |
protein_targeting_sorting_and_translocation |
|||
|
32 |
recombination_and_dna_repair |
|||
|
33 |
regulation_of_carbohydrate_utilization |
# |
NAME |
PSSM |
Huges et. al |
Default AlignACE |
|
34 |
respiration |
|||
|
35 |
ribosomal_proteins |
|||
|
36 |
sporulation_and_germination |
|||
|
37 |
stress_response |
|||
|
38 |
transcriptional_control |
|||
|
39 |
transcription_factors |
|||
|
40 |
translation_initiation_elongation_and_termination |
|||
|
41 |
vesicular_transport_golgi_network_etc |