Feature #516
openAdd conservation track from ucsc
0%
Description
Blosum100 is a simple measure of conservation, but is a better measure of how disruptive a particular amino acid change will be. We should also have the phyloP score, downloaded from ucsc, for the base in consideration, where available.
The phyloP score is the -log(p) measure of predicted conservation for the specific base (as opposed to phastCons which is a measure of conservation of the base in the context of flanking bases). Note that the measure also includes negative values, which indicate the base in question is predicted to be fast-evolving.
I've downloaded the broadest track, vertebrates (there are also mammal and primates), and written a very simple program (lookup.c) to pull out values given chr# and location. Interestingly, the lookup values agree qualitatively with the ucsc genome browser values, but they are not an exact match - perhaps the phyloP switches used for the ucsc ftp files were slightly different from switches used for the data in the genome browser. It might be worth re-running some of the data.
All can be found on:
snp-dev:/nomirror/krobasky/conservation
Closing this ticket should include folding these data into the GET INSTALL scripts.
Reference:
[2009, Pollard, Hubisz, Rosenbloom, et al, Gen Res]
http://genome.cshlp.org/content/20/1/110.full.pdf
Updated by Madeleine Ball over 14 years ago
Replying to [comment:1 https://www.google.com/accounts/o8/id?id=AItOawmjAqCfwHP2SRB4CAwzldFZzlop-n5GiCo]:
Is this a trac bug? Owner is krobasky (me)
Updated by Ward Vandewege almost 14 years ago
- Project changed from 19 to GET-Evidence
- Category deleted (
GET-Evidence)
Updated by Madeleine Ball almost 14 years ago
- Assigned To deleted (
Jason Bobe)
Assignment to Jason Bobe was an error in the import process. It was assigned to Kim, but since that was a long time ago I'm going to reassign it to nobody.