I cannot find an ortholog.txt.table file from Treefam. Where is that
Also, TreeFam uses Ensembl. Are there docs on how to the us ID resolver
to map these ids to the RGD ids that RatMine uses?
From: Julie Sullivan [mailto:[hidden email]]
Sent: Tuesday, June 15, 2010 10:14 AM
To: Sierra Moxon; Vallejos, Andrew; Kalpana Karra
Subject: m4m grant II : data loaded so far
I am just filling in my chart for the new grant and I was hoping to add
Do you plan on loading TreeFam any time soon?
Andrew, I know you were having issues with BioGRID's identifiers. Is
going to be resolved?
If you have other work to keep you busy (ha ha), please wait to load TreeFam
until after the next release of InterMine. Currently the TreeFam converter
loads all of the orthologs for ALL of the organisms you specify in your
project.xml, which takes a really long time as there is a lot of data.
Sierra pointed out that she was just interested in Fish <--> Rat and Fish <-->
Fly orthologs, NOT Fly <--> Rat orthologs (also true of interactions). So I am
going to update the converter to take in a `organism` parameter (rat) and a
`homolog organisms` parameter (fish, fly, etc). This will cut down on the
amount of (unwanted) data loaded and speed up the build process. 0.94 should be
ready in a few weeks.
There isn't any documentation yet on ID resolvers. Here is the javadoc:
You want to populate a data structure that consists of taxonId,
primaryIdentifier, and a list of other identifiers. All of our converters
implement the FlyBase ID resolver; let me know if you have questions getting
this to work.
Vallejos, Andrew wrote:
> I cannot find an ortholog.txt.table file from Treefam. Where is that
> Also, TreeFam uses Ensembl. Are there docs on how to the us ID resolver
> to map these ids to the RGD ids that RatMine uses?