updating evidence based gff with IPRScan results

classic Classic list List threaded Threaded
18 messages Options
Reply | Threaded
Open this post in threaded view
|

updating evidence based gff with IPRScan results

claudia

Hi Carson,

  A little while back you provided me with a script to edit my gff files in order to update them with IPRscan resutls. It seems I am able to do so when only maker models are present in the gff file, however, when I try to update an anlysis/evidence based gff file I recieve the errors below. I have attached the files for you to look at, any help is appreciated. (  i attached all sets of files, those that worked, 'models', and those that did not, 'abinit')

Thanks in advance,
Claudia


ERROR ( is associated with all lines in the IPRSCAN file)

Use of uninitialized value $gene_id in hash element at /usr/local/genome/maker/maker-2.10/bin/ipr_update_gff line 163, <$IN> line 61.

_______________________________________________
maker-devel mailing list
[hidden email]
http://box290.bluehost.com/mailman/listinfo/maker-devel_yandell-lab.org

analysis.fixed.gff (444K) Download Attachment
g03-hmmp-abinit.raw (10K) Download Attachment
g03-all.models.fixed.gff (274K) Download Attachment
g03hmmpmodels.raw (1K) Download Attachment
Reply | Threaded
Open this post in threaded view
|

Re: updating evidence based gff with IPRScan results

Carson Hinton Holt
Re: updating evidence based gff with IPRScan results The script is working with the gene attributes so results where you have interpro domains for match/match_part entries causes the error.  You can use the gff3_preds2models script to convert the set of ab initio prediction entries into gene/mRNA/exon/CDS models.  You just provide a list of what predictions you want to convert (1 ID per line).

--Carson


On 4/12/11 2:18 PM, "Dinatale C" <dinatal@...> wrote:

Hi Carson,

  A little while back you provided me with a script to edit my gff files in order to update them with IPRscan resutls. It seems I am able to do so when only maker models are present in the gff file, however, when I try to update an anlysis/evidence based gff file I recieve the errors below. I have attached the files for you to look at, any help is appreciated. (  i attached all sets of files, those that worked, 'models', and those that did not, 'abinit')

Thanks in advance,
Claudia


ERROR ( is associated with all lines in the IPRSCAN file)
Use of uninitialized value $gene_id in hash element at /usr/local/genome/maker/maker-2.10/bin/ipr_update_gff line 163, <$IN> line 61.


_______________________________________________
maker-devel mailing list
[hidden email]
http://box290.bluehost.com/mailman/listinfo/maker-devel_yandell-lab.org
Reply | Threaded
Open this post in threaded view
|

updating gff3 note attribute with wu-blast hits using maker_functional_gff

claudia
Hi,
 I was wondering if anyone can tell me exactly how the maker_functional_gff script works?

 I have attempted to update a gff3 with a  Wu-blast output of maker proteins ( and specifying uniprot-sprot db), as well, I have downloaded the uniprot sprot fasta of the most up to date protein db in fasta format ( not sure if this is the right file to use) and this is what i used as the 'uniprot-sprot fasta', but Im not sure if this is correct.

Basically, the gff3 note attribute gets updated with only "protein of unknown function", yet I know that there are specific protein hits that it is not utilizing, as well these protein hits are located in both the wu-blast output and the uniprot-sprot db fasta file.

Thanks,
Claudia



_______________________________________________
maker-devel mailing list
[hidden email]
http://box290.bluehost.com/mailman/listinfo/maker-devel_yandell-lab.org
Reply | Threaded
Open this post in threaded view
|

Re: updating gff3 note attribute with wu-blast hits using maker_functional_gff

Carson Hinton Holt
Re: updating gff3 note attribute with wu-blast hits using maker_functional_gff Usage:

maker_functional_gff uniprot-sprot.fasta uniprot-sprot.blast.out maker.gff

The uniprot-sprot.fasta file is just the raw uniprot fasta file

The uniprot-sprot.blast.out file is the WUBLAST results using –mformat=2 option (tab delimited).  These are blast results for the MAKER proteins against the uniprot/swiss-prot dataset. Ie -->
blastp  uniprot-sprot.fasta  maker_proteins.fasta -mformat-=2 > uniprot-sprot.blast.out
(order of files is important maker_proteins must come second)

maker.gff is just a MAKER produced GFF3 file.


The script takes the best hit for each maker protein and keeps that function.

--Carson


On 5/11/11 11:58 AM, "claudia" <dinatal@...> wrote:

  Hi,
  I was wondering if anyone can tell me exactly how the maker_functional_gff script works?
 
  I have attempted to update a gff3 with a  Wu-blast output of maker proteins ( and specifying uniprot-sprot db), as well, I have downloaded the uniprot sprot fasta of the most up to date protein db in fasta format ( not sure if this is the right file to use) and this is what i used as the 'uniprot-sprot fasta', but Im not sure if this is correct.
 
 Basically, the gff3 note attribute gets updated with only "protein of unknown function", yet I know that there are specific protein hits that it is not utilizing, as well these protein hits are located in both the wu-blast output and the uniprot-sprot db fasta file.
 
 Thanks,
 Claudia
 
 
 

Carson Holt
Graduate Student
Yandell Lab
http:/www.yandell-lab.org/
Eccles Institute of Human Genetics
University of Utah

_______________________________________________
maker-devel mailing list
[hidden email]
http://box290.bluehost.com/mailman/listinfo/maker-devel_yandell-lab.org
Reply | Threaded
Open this post in threaded view
|

Re: updating gff3 note attribute with wu-blast hits using maker_functional_gff

claudia
Hi Carson,
 Thanks for the reply, I see how it works now, I was generating a wu-blast output from the actual website.
One thing, if I am using NCBI blast, will this script still work? I think the formating is different from wu-blast.

Thanks, Claudia


On 11/05/2011 2:23 PM, Carson Holt wrote:
Re: updating gff3 note attribute with wu-blast hits using maker_functional_gff Usage:

maker_functional_gff uniprot-sprot.fasta uniprot-sprot.blast.out maker.gff

The uniprot-sprot.fasta file is just the raw uniprot fasta file

The uniprot-sprot.blast.out file is the WUBLAST results using –mformat=2 option (tab delimited).  These are blast results for the MAKER proteins against the uniprot/swiss-prot dataset. Ie -->
blastp  uniprot-sprot.fasta  maker_proteins.fasta -mformat-=2 > uniprot-sprot.blast.out
(order of files is important maker_proteins must come second)

maker.gff is just a MAKER produced GFF3 file.


The script takes the best hit for each maker protein and keeps that function.

--Carson


On 5/11/11 11:58 AM, "claudia" <dinatal@...> wrote:

  Hi,
  I was wondering if anyone can tell me exactly how the maker_functional_gff script works?
 
  I have attempted to update a gff3 with a  Wu-blast output of maker proteins ( and specifying uniprot-sprot db), as well, I have downloaded the uniprot sprot fasta of the most up to date protein db in fasta format ( not sure if this is the right file to use) and this is what i used as the 'uniprot-sprot fasta', but Im not sure if this is correct.
 
 Basically, the gff3 note attribute gets updated with only "protein of unknown function", yet I know that there are specific protein hits that it is not utilizing, as well these protein hits are located in both the wu-blast output and the uniprot-sprot db fasta file.
 
 Thanks,
 Claudia
 
 
 

Carson Holt
Graduate Student
Yandell Lab
http:/www.yandell-lab.org/
Eccles Institute of Human Genetics
University of Utah


_______________________________________________
maker-devel mailing list
[hidden email]
http://box290.bluehost.com/mailman/listinfo/maker-devel_yandell-lab.org
Reply | Threaded
Open this post in threaded view
|

Re: updating gff3 note attribute with wu-blast hits using maker_functional_gff

Carson Hinton Holt
Re: updating gff3 note attribute with wu-blast hits using maker_functional_gff You can try NCBI’s tab delimited output format.  You may have to change the order of columns.  First column must be the query, second the hit, and third the score.  I believe all other columns are ignored.

--Carson


On 5/11/11 12:43 PM, "claudia" <dinatal@...> wrote:

 Hi Carson,
  Thanks for the reply, I see how it works now, I was generating a wu-blast output from the actual website.
 One thing, if I am using NCBI blast, will this script still work? I think the formating is different from wu-blast.
 
 Thanks, Claudia
 
 
 On 11/05/2011 2:23 PM, Carson Holt wrote:
Re: updating gff3 note attribute with wu-blast hits using maker_functional_gff Usage:
 
 maker_functional_gff uniprot-sprot.fasta uniprot-sprot.blast.out maker.gff
 
 The uniprot-sprot.fasta file is just the raw uniprot fasta file
 
 The uniprot-sprot.blast.out file is the WUBLAST results using –mformat=2 option (tab delimited).  These are blast results for the MAKER proteins against the uniprot/swiss-prot dataset. Ie -->
  
blastp  uniprot-sprot.fasta  maker_proteins.fasta -mformat-=2 > uniprot-sprot.blast.out
 
(order of files is important maker_proteins must come second)
 
 maker.gff is just a MAKER produced GFF3 file.
 
 
 The script takes the best hit for each maker protein and keeps that function.
 
 --Carson
 
 
 On 5/11/11 11:58 AM, "claudia" <dinatal@...> wrote:
 
  
  Hi,
   I was wondering if anyone can tell me exactly how the maker_functional_gff script works?
  
   I have attempted to update a gff3 with a  Wu-blast output of maker proteins ( and specifying uniprot-sprot db), as well, I have downloaded the uniprot sprot fasta of the most up to date protein db in fasta format ( not sure if this is the right file to use) and this is what i used as the 'uniprot-sprot fasta', but Im not sure if this is correct.
  
  Basically, the gff3 note attribute gets updated with only "protein of unknown function", yet I know that there are specific protein hits that it is not utilizing, as well these protein hits are located in both the wu-blast output and the uniprot-sprot db fasta file.
  
  Thanks,
  Claudia
  
  
  
 

 Carson Holt
 Graduate Student
 Yandell Lab
 http:/www.yandell-lab.org/
 Eccles Institute of Human Genetics
 University of Utah
  

 

Carson Holt
Graduate Student
Yandell Lab
http:/www.yandell-lab.org/
Eccles Institute of Human Genetics
University of Utah

_______________________________________________
maker-devel mailing list
[hidden email]
http://box290.bluehost.com/mailman/listinfo/maker-devel_yandell-lab.org
Reply | Threaded
Open this post in threaded view
|

usage of add_utr scripts

claudia

Hi,

  So I have attempted to use the add_utr_to_gff3 script and it seems to return many errors, see below: ( I have attached the gff3 file I was trying to update)


Global symbol "$s" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 26.
Global symbol "@files" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 34.
Global symbol "@files" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 40.
Global symbol "@files" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 48.
Global symbol "@stuff" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 63.
Global symbol "@stuff" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 86.
Global symbol "@stuff" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 87.
Global symbol "@stuff" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 88.
Global symbol "$header" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 93.
Global symbol "$footer" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 96.
Global symbol "@stuff" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 111.
Global symbol "$footer" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 187.
Global symbol "@regions" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 445.
Missing right curly or square bracket at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 489, at end of line
/usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 has too many errors.

so I then tried the other script add_utr_start_stop_codon_gff instead and the script added the features however it seems to change all of my I.D's to an increasing numerical order, see below:


##gff-version 3
contig00005    maker    gene    78618    82008    .    +    .    ID=1;Name=augustus_masked-contig00005-abinit-gene-0.15
contig00005    maker    mRNA    78618    82008    .    +    .    ID=2;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1;Parent=1
contig00005    maker    exon    78618    78709    0.04    +    .    ID=3;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:exon:0;Parent=2
contig00005    maker    exon    79771    79837    0.04    +    .    ID=4;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:exon:1;Parent=2
contig00005    maker    exon    80234    81429    0.04    +    .    ID=5;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:exon:2;Parent=2
contig00005    maker    exon    81587    82008    6726    +    .    ID=6;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:exon:3;Parent=2
contig00005    maker    five_prime_UTR    78618    78709    .    +    .    ID=7;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:UTR1;Parent=2
contig00005    maker    five_prime_UTR    79771    79837    .    +    .    ID=8;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:UTR2;Parent=2
contig00005    maker    five_prime_UTR    80234    80556    .    +    .    ID=9;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:UTR3;Parent=2
contig00005    maker    CDS    80557    81429    .    +    0    ID=10;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:cds:0;Parent=2
contig00005    maker    CDS    81587    81910    .    +    0    ID=11;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:cds:1;Parent=2
contig00005    maker    three_prime_UTR    81911    82008    .    +    .    ID=12;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:UTR4;Parent=2
contig00005    maker    start_codon    80557    80559    .    +    .    ID=13;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:start1;Parent=2
contig00005    maker    stop_codon    81908    81910    .    +    .    ID=14;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:stop2;Parent=2


Does anyone have any idea how to control this so that my I.D's don't change, and/ or how to use the other script without getting the errors?


Thanks in advance,

Claudia


_______________________________________________
maker-devel mailing list
[hidden email]
http://box290.bluehost.com/mailman/listinfo/maker-devel_yandell-lab.org

models.gff (155K) Download Attachment
Reply | Threaded
Open this post in threaded view
|

Re: usage of add_utr scripts

Carson Hinton Holt
Re: [maker-devel] usage of add_utr scripts The add_utr_to_gff3 script should have been pulled out, and somehow was left in the 2.10 release.  The add_utr_start_stop_codon_gff script changes IDs, which normally isn’t too big a problem because chado will also change them (ID’s just need to be unique and are not necessarily informative).

If you need those to be unchanged, I’ll modify the script.  It will be a few days though because I’m defending my thesis on Monday.

You can also use MAKER 2.11-beta.  Named UTR is supposed to be optional in GFF3 format because it is always implicit anyway, but MAKER 2.11 now adds explicit UTR by default to better work with JBrowse.  Just give it the other MAKER’s GFF3 file and turn all the pass options to 1.  Don’t use other input files including repeatmasker options.

Thanks,
Carson

On 5/12/11 11:20 AM, "Dinatale C" <dinatal@...> wrote:

Hi,

  So I have attempted to use the add_utr_to_gff3 script and it seems to return many errors, see below: ( I have attached the gff3 file I was trying to update)

Global symbol "$s" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 26.
Global symbol "@files" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 34.
Global symbol "@files" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 40.
Global symbol "@files" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 48.
Global symbol "@stuff" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 63.
Global symbol "@stuff" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 86.
Global symbol "@stuff" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 87.
Global symbol "@stuff" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 88.
Global symbol "$header" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 93.
Global symbol "$footer" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 96.
Global symbol "@stuff" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 111.
Global symbol "$footer" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 187.
Global symbol "@regions" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 445.
Missing right curly or square bracket at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 489, at end of line
/usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 has too many errors.

so I then tried the other script add_utr_start_stop_codon_gff instead and the script added the features however it seems to change all of my I.D's to an increasing numerical order, see below:


##gff-version 3
contig00005    maker    gene    78618    82008    .    +    .    ID=1;Name=augustus_masked-contig00005-abinit-gene-0.15
contig00005    maker    mRNA    78618    82008    .    +    .    ID=2;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1;Parent=1
contig00005    maker    exon    78618    78709    0.04    +    .    ID=3;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:exon:0;Parent=2
contig00005    maker    exon    79771    79837    0.04    +    .    ID=4;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:exon:1;Parent=2
contig00005    maker    exon    80234    81429    0.04    +    .    ID=5;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:exon:2;Parent=2
contig00005    maker    exon    81587    82008    6726    +    .    ID=6;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:exon:3;Parent=2
contig00005    maker    five_prime_UTR    78618    78709    .    +    .    ID=7;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:UTR1;Parent=2
contig00005    maker    five_prime_UTR    79771    79837    .    +    .    ID=8;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:UTR2;Parent=2
contig00005    maker    five_prime_UTR    80234    80556    .    +    .    ID=9;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:UTR3;Parent=2
contig00005    maker    CDS    80557    81429    .    +    0    ID=10;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:cds:0;Parent=2
contig00005    maker    CDS    81587    81910    .    +    0    ID=11;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:cds:1;Parent=2
contig00005    maker    three_prime_UTR    81911    82008    .    +    .    ID=12;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:UTR4;Parent=2
contig00005    maker    start_codon    80557    80559    .    +    .    ID=13;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:start1;Parent=2
contig00005    maker    stop_codon    81908    81910    .    +    .    ID=14;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:stop2;Parent=2


Does anyone have any idea how to control this so that my I.D's don't change, and/ or how to use the other script without getting the errors?


Thanks in advance,

Claudia



Carson Holt
Graduate Student
Yandell Lab
http:/www.yandell-lab.org/
Eccles Institute of Human Genetics
University of Utah

_______________________________________________
maker-devel mailing list
[hidden email]
http://box290.bluehost.com/mailman/listinfo/maker-devel_yandell-lab.org
Reply | Threaded
Open this post in threaded view
|

Re: usage of add_utr scripts

Khan, Anar
Re: [maker-devel] usage of add_utr scripts

Hi Carson

 

I’m running maker version 2.09 and would like to add UTRs to the gff output. I’m getting errors for both the add_utr_start_stop_gff and add_utr_to_gff3 scripts.

 

I see from your previous post that an upgrade is recommended but I don’t think our license includes upgrades.

 

I was just wondering if you have any ideas on how to obtain UTR info using v2.09? The script errors are pasted below.

 

Cheers

Anar

 

add_utr_to_gff3 test_maker.orig.gff

Global symbol "$s" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 26.

Global symbol "@files" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 34.

Global symbol "@files" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 40.

Global symbol "@files" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 48.

Global symbol "@stuff" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 63.

Global symbol "@stuff" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 86.

Global symbol "@stuff" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 87.

Global symbol "@stuff" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 88.

Global symbol "$header" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 93.

Global symbol "$footer" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 96.

Global symbol "@stuff" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 111.

Global symbol "$footer" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 187.

Global symbol "@regions" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 445.

Missing right curly or square bracket at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 489, at end of line

/usr/local/maker_2.09beta/bin/add_utr_to_gff3 has too many errors.

 

--------------

 

add_utr_start_stop_gff test_maker.orig.gff

Use of uninitialized value in split at /usr/local/maker_2.09beta/bin/add_utr_start_stop_gff line 89.

Use of uninitialized value in substr at /usr/local/maker_2.09beta/bin/add_utr_start_stop_gff line 305.

substr outside of string at /usr/local/maker_2.09beta/bin/add_utr_start_stop_gff line 305.

Use of uninitialized value in substr at /usr/local/maker_2.09beta/bin/add_utr_start_stop_gff line 306.

substr outside of string at /usr/local/maker_2.09beta/bin/add_utr_start_stop_gff line 306.

Use of uninitialized value in print at /usr/local/maker_2.09beta/bin/add_utr_start_stop_gff line 180.

 

 

 

From: [hidden email] [mailto:[hidden email]] On Behalf Of Carson Holt
Sent: Friday, 13 May 2011 5:54 a.m.
To: Dinatale C; [hidden email]
Subject: Re: [maker-devel] usage of add_utr scripts

 

The add_utr_to_gff3 script should have been pulled out, and somehow was left in the 2.10 release.  The add_utr_start_stop_codon_gff script changes IDs, which normally isn’t too big a problem because chado will also change them (ID’s just need to be unique and are not necessarily informative).

If you need those to be unchanged, I’ll modify the script.  It will be a few days though because I’m defending my thesis on Monday.

You can also use MAKER 2.11-beta.  Named UTR is supposed to be optional in GFF3 format because it is always implicit anyway, but MAKER 2.11 now adds explicit UTR by default to better work with JBrowse.  Just give it the other MAKER’s GFF3 file and turn all the pass options to 1.  Don’t use other input files including repeatmasker options.

Thanks,
Carson

On 5/12/11 11:20 AM, "Dinatale C" <dinatal@...> wrote:

Hi,

  So I have attempted to use the add_utr_to_gff3 script and it seems to return many errors, see below: ( I have attached the gff3 file I was trying to update)


Global symbol "$s" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 26.
Global symbol "@files" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 34.
Global symbol "@files" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 40.
Global symbol "@files" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 48.
Global symbol "@stuff" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 63.
Global symbol "@stuff" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 86.
Global symbol "@stuff" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 87.
Global symbol "@stuff" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 88.
Global symbol "$header" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 93.
Global symbol "$footer" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 96.
Global symbol "@stuff" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 111.
Global symbol "$footer" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 187.
Global symbol "@regions" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 445.
Missing right curly or square bracket at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 489, at end of line
/usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 has too many errors.


so I then tried the other script add_utr_start_stop_codon_gff instead and the script added the features however it seems to change all of my I.D's to an increasing numerical order, see below:


##gff-version 3
contig00005    maker    gene    78618    82008    .    +    .    ID=1;Name=augustus_masked-contig00005-abinit-gene-0.15
contig00005    maker    mRNA    78618    82008    .    +    .    ID=2;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1;Parent=1
contig00005    maker    exon    78618    78709    0.04    +    .    ID=3;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:exon:0;Parent=2
contig00005    maker    exon    79771    79837    0.04    +    .    ID=4;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:exon:1;Parent=2
contig00005    maker    exon    80234    81429    0.04    +    .    ID=5;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:exon:2;Parent=2
contig00005    maker    exon    81587    82008    6726    +    .    ID=6;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:exon:3;Parent=2
contig00005    maker    five_prime_UTR    78618    78709    .    +    .    ID=7;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:UTR1;Parent=2
contig00005    maker    five_prime_UTR    79771    79837    .    +    .    ID=8;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:UTR2;Parent=2
contig00005    maker    five_prime_UTR    80234    80556    .    +    .    ID=9;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:UTR3;Parent=2
contig00005    maker    CDS    80557    81429    .    +    0    ID=10;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:cds:0;Parent=2
contig00005    maker    CDS    81587    81910    .    +    0    ID=11;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:cds:1;Parent=2
contig00005    maker    three_prime_UTR    81911    82008    .    +    .    ID=12;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:UTR4;Parent=2
contig00005    maker    start_codon    80557    80559    .    +    .    ID=13;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:start1;Parent=2
contig00005    maker    stop_codon    81908    81910    .    +    .    ID=14;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:stop2;Parent=2



Does anyone have any idea how to control this so that my I.D's don't change, and/ or how to use the other script without getting the errors?


Thanks in advance,

Claudia


Carson Holt
Graduate Student
Yandell Lab
http:/www.yandell-lab.org/
Eccles Institute of Human Genetics
University of Utah

 


Attention: The information contained in this message and/or attachments from AgResearch Limited is intended only for the persons or entities to which it is addressed and may contain confidential and/or privileged material. Any review, retransmission, dissemination or other use of, or taking of any action in reliance upon, this information by persons or entities other than the intended recipients is prohibited by AgResearch Limited. If you have received this message in error, please notify the sender immediately.


 


_______________________________________________
maker-devel mailing list
[hidden email]
http://box290.bluehost.com/mailman/listinfo/maker-devel_yandell-lab.org
Reply | Threaded
Open this post in threaded view
|

Re: usage of add_utr scripts

Marvin B Moore
<base href="x-msg://63/">Hi Anar,

I looks like the add_utr_to_gff3 script is completely hosed in that release.  I think it's just there as a relic and add_utr_start_stop_gff is the script that you want.  From those errors it looks like your gff3 file doesn't have the fasta sequence at the end of it. 

If I've guessed correctly, the quick workaround would be to just add lines like the following to the end of your GFF3 file - where ,of course, the fasta header and sequence are replaced with the correct data for the chromosome(s)/contig(s) represented in your GFF3 file. 

##FASTA
>chr1
ATCGATACGTAG...

B

On Sep 13, 2011, at 5:22 PM, Khan, Anar wrote:

Hi Carson
 
I’m running maker version 2.09 and would like to add UTRs to the gff output. I’m getting errors for both the add_utr_start_stop_gff and add_utr_to_gff3 scripts.
 
I see from your previous post that an upgrade is recommended but I don’t think our license includes upgrades.
 
I was just wondering if you have any ideas on how to obtain UTR info using v2.09? The script errors are pasted below.
 
Cheers
Anar
 
add_utr_to_gff3 test_maker.orig.gff
Global symbol "$s" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 26.
Global symbol "@files" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 34.
Global symbol "@files" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 40.
Global symbol "@files" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 48.
Global symbol "@stuff" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 63.
Global symbol "@stuff" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 86.
Global symbol "@stuff" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 87.
Global symbol "@stuff" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 88.
Global symbol "$header" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 93.
Global symbol "$footer" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 96.
Global symbol "@stuff" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 111.
Global symbol "$footer" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 187.
Global symbol "@regions" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 445.
Missing right curly or square bracket at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 489, at end of line
/usr/local/maker_2.09beta/bin/add_utr_to_gff3 has too many errors.
 
--------------
 
add_utr_start_stop_gff test_maker.orig.gff
Use of uninitialized value in split at /usr/local/maker_2.09beta/bin/add_utr_start_stop_gff line 89.
Use of uninitialized value in substr at /usr/local/maker_2.09beta/bin/add_utr_start_stop_gff line 305.
substr outside of string at /usr/local/maker_2.09beta/bin/add_utr_start_stop_gff line 305.
Use of uninitialized value in substr at /usr/local/maker_2.09beta/bin/add_utr_start_stop_gff line 306.
substr outside of string at /usr/local/maker_2.09beta/bin/add_utr_start_stop_gff line 306.
Use of uninitialized value in print at /usr/local/maker_2.09beta/bin/add_utr_start_stop_gff line 180.
 
 
 
From: [hidden email] [mailto:[hidden email]] On Behalf Of Carson Holt
Sent: Friday, 13 May 2011 5:54 a.m.
To: Dinatale C; [hidden email]
Subject: Re: [maker-devel] usage of add_utr scripts
 

The add_utr_to_gff3 script should have been pulled out, and somehow was left in the 2.10 release.  The add_utr_start_stop_codon_gff script changes IDs, which normally isn’t too big a problem because chado will also change them (ID’s just need to be unique and are not necessarily informative).

If you need those to be unchanged, I’ll modify the script.  It will be a few days though because I’m defending my thesis on Monday.

You can also use MAKER 2.11-beta.  Named UTR is supposed to be optional in GFF3 format because it is always implicit anyway, but MAKER 2.11 now adds explicit UTR by default to better work with JBrowse.  Just give it the other MAKER’s GFF3 file and turn all the pass options to 1.  Don’t use other input files including repeatmasker options.

Thanks,
Carson

On 5/12/11 11:20 AM, "Dinatale C" <dinatal@...> wrote:

Hi,

  So I have attempted to use the add_utr_to_gff3 script and it seems to return many errors, see below: ( I have attached the gff3 file I was trying to update)

Global symbol "$s" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 26.
Global symbol "@files" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 34.
Global symbol "@files" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 40.
Global symbol "@files" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 48.
Global symbol "@stuff" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 63.
Global symbol "@stuff" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 86.
Global symbol "@stuff" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 87.
Global symbol "@stuff" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 88.
Global symbol "$header" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 93.
Global symbol "$footer" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 96.
Global symbol "@stuff" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 111.
Global symbol "$footer" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 187.
Global symbol "@regions" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 445.
Missing right curly or square bracket at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 489, at end of line
/usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 has too many errors.

so I then tried the other script add_utr_start_stop_codon_gff instead and the script added the features however it seems to change all of my I.D's to an increasing numerical order, see below:


##gff-version 3
contig00005    maker    gene    78618    82008    .    +    .    ID=1;Name=augustus_masked-contig00005-abinit-gene-0.15
contig00005    maker    mRNA    78618    82008    .    +    .    ID=2;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1;Parent=1
contig00005    maker    exon    78618    78709    0.04    +    .    ID=3;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:exon:0;Parent=2
contig00005    maker    exon    79771    79837    0.04    +    .    ID=4;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:exon:1;Parent=2
contig00005    maker    exon    80234    81429    0.04    +    .    ID=5;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:exon:2;Parent=2
contig00005    maker    exon    81587    82008    6726    +    .    ID=6;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:exon:3;Parent=2
contig00005    maker    five_prime_UTR    78618    78709    .    +    .    ID=7;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:UTR1;Parent=2
contig00005    maker    five_prime_UTR    79771    79837    .    +    .    ID=8;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:UTR2;Parent=2
contig00005    maker    five_prime_UTR    80234    80556    .    +    .    ID=9;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:UTR3;Parent=2
contig00005    maker    CDS    80557    81429    .    +    0    ID=10;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:cds:0;Parent=2
contig00005    maker    CDS    81587    81910    .    +    0    ID=11;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:cds:1;Parent=2
contig00005    maker    three_prime_UTR    81911    82008    .    +    .    ID=12;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:UTR4;Parent=2
contig00005    maker    start_codon    80557    80559    .    +    .    ID=13;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:start1;Parent=2
contig00005    maker    stop_codon    81908    81910    .    +    .    ID=14;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:stop2;Parent=2


Does anyone have any idea how to control this so that my I.D's don't change, and/ or how to use the other script without getting the errors?


Thanks in advance,

Claudia


Carson Holt
Graduate Student
Yandell Lab
http:/www.yandell-lab.org/
Eccles Institute of Human Genetics
University of Utah
 



Attention: The information contained in this message and/or attachments from AgResearch Limited is intended only for the persons or entities to which it is addressed and may contain confidential and/or privileged material. Any review, retransmission, dissemination or other use of, or taking of any action in reliance upon, this information by persons or entities other than the intended recipients is prohibited by AgResearch Limited. If you have received this message in error, please notify the sender immediately.




 
<ATT00001..txt>

Barry Moore
Research Scientist
Dept. of Human Genetics
University of Utah
Salt Lake City, UT 84112
--------------------------------------------
(801) 585-3543





_______________________________________________
maker-devel mailing list
[hidden email]
http://box290.bluehost.com/mailman/listinfo/maker-devel_yandell-lab.org
Reply | Threaded
Open this post in threaded view
|

Re: usage of add_utr scripts

Carson Hinton Holt
2.11 and above no longer have those scripts (UTR is always added by default now).  So this is a round about way to do it --> upgrade MAKER, then provide the merged GFF3 (made with gff3_merge) to genome_gff in the makeropts.ctl file, then set all pass options to 1.  Then make sure to clear all other options (repeatprotein model_org etc.).  When you rerun MAKER, it will just update the GFF3.

--Carson



From: Barry Moore <[hidden email]>
Date: Tue, 13 Sep 2011 18:41:16 -0600
To: "Khan, Anar" <[hidden email]>
Cc: Carson Holt <[hidden email]>, Dinatale C <[hidden email]>, "[hidden email]" <[hidden email]>
Subject: Re: [maker-devel] usage of add_utr scripts

<base href="x-msg://63/">
Hi Anar,

I looks like the add_utr_to_gff3 script is completely hosed in that release.  I think it's just there as a relic and add_utr_start_stop_gff is the script that you want.  From those errors it looks like your gff3 file doesn't have the fasta sequence at the end of it. 

If I've guessed correctly, the quick workaround would be to just add lines like the following to the end of your GFF3 file - where ,of course, the fasta header and sequence are replaced with the correct data for the chromosome(s)/contig(s) represented in your GFF3 file. 

##FASTA
>chr1
ATCGATACGTAG...

B

On Sep 13, 2011, at 5:22 PM, Khan, Anar wrote:

Hi Carson
 
I’m running maker version 2.09 and would like to add UTRs to the gff output. I’m getting errors for both the add_utr_start_stop_gff and add_utr_to_gff3 scripts.
 
I see from your previous post that an upgrade is recommended but I don’t think our license includes upgrades.
 
I was just wondering if you have any ideas on how to obtain UTR info using v2.09? The script errors are pasted below.
 
Cheers
Anar
 
add_utr_to_gff3 test_maker.orig.gff
Global symbol "$s" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 26.
Global symbol "@files" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 34.
Global symbol "@files" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 40.
Global symbol "@files" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 48.
Global symbol "@stuff" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 63.
Global symbol "@stuff" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 86.
Global symbol "@stuff" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 87.
Global symbol "@stuff" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 88.
Global symbol "$header" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 93.
Global symbol "$footer" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 96.
Global symbol "@stuff" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 111.
Global symbol "$footer" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 187.
Global symbol "@regions" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 445.
Missing right curly or square bracket at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 489, at end of line
/usr/local/maker_2.09beta/bin/add_utr_to_gff3 has too many errors.
 
--------------
 
add_utr_start_stop_gff test_maker.orig.gff
Use of uninitialized value in split at /usr/local/maker_2.09beta/bin/add_utr_start_stop_gff line 89.
Use of uninitialized value in substr at /usr/local/maker_2.09beta/bin/add_utr_start_stop_gff line 305.
substr outside of string at /usr/local/maker_2.09beta/bin/add_utr_start_stop_gff line 305.
Use of uninitialized value in substr at /usr/local/maker_2.09beta/bin/add_utr_start_stop_gff line 306.
substr outside of string at /usr/local/maker_2.09beta/bin/add_utr_start_stop_gff line 306.
Use of uninitialized value in print at /usr/local/maker_2.09beta/bin/add_utr_start_stop_gff line 180.
 
 
 
From: [hidden email] [[hidden email]] On Behalf Of Carson Holt
Sent: Friday, 13 May 2011 5:54 a.m.
To: Dinatale C; [hidden email]
Subject: Re: [maker-devel] usage of add_utr scripts
 

The add_utr_to_gff3 script should have been pulled out, and somehow was left in the 2.10 release.  The add_utr_start_stop_codon_gff script changes IDs, which normally isn’t too big a problem because chado will also change them (ID’s just need to be unique and are not necessarily informative).

If you need those to be unchanged, I’ll modify the script.  It will be a few days though because I’m defending my thesis on Monday.

You can also use MAKER 2.11-beta.  Named UTR is supposed to be optional in GFF3 format because it is always implicit anyway, but MAKER 2.11 now adds explicit UTR by default to better work with JBrowse.  Just give it the other MAKER’s GFF3 file and turn all the pass options to 1.  Don’t use other input files including repeatmasker options.

Thanks,
Carson

On 5/12/11 11:20 AM, "Dinatale C" <dinatal@...> wrote:

Hi,

  So I have attempted to use the add_utr_to_gff3 script and it seems to return many errors, see below: ( I have attached the gff3 file I was trying to update)

Global symbol "$s" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 26.
Global symbol "@files" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 34.
Global symbol "@files" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 40.
Global symbol "@files" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 48.
Global symbol "@stuff" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 63.
Global symbol "@stuff" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 86.
Global symbol "@stuff" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 87.
Global symbol "@stuff" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 88.
Global symbol "$header" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 93.
Global symbol "$footer" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 96.
Global symbol "@stuff" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 111.
Global symbol "$footer" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 187.
Global symbol "@regions" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 445.
Missing right curly or square bracket at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 489, at end of line
/usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 has too many errors.

so I then tried the other script add_utr_start_stop_codon_gff instead and the script added the features however it seems to change all of my I.D's to an increasing numerical order, see below:


##gff-version 3
contig00005    maker    gene    78618    82008    .    +    .    ID=1;Name=augustus_masked-contig00005-abinit-gene-0.15
contig00005    maker    mRNA    78618    82008    .    +    .    ID=2;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1;Parent=1
contig00005    maker    exon    78618    78709    0.04    +    .    ID=3;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:exon:0;Parent=2
contig00005    maker    exon    79771    79837    0.04    +    .    ID=4;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:exon:1;Parent=2
contig00005    maker    exon    80234    81429    0.04    +    .    ID=5;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:exon:2;Parent=2
contig00005    maker    exon    81587    82008    6726    +    .    ID=6;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:exon:3;Parent=2
contig00005    maker    five_prime_UTR    78618    78709    .    +    .    ID=7;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:UTR1;Parent=2
contig00005    maker    five_prime_UTR    79771    79837    .    +    .    ID=8;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:UTR2;Parent=2
contig00005    maker    five_prime_UTR    80234    80556    .    +    .    ID=9;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:UTR3;Parent=2
contig00005    maker    CDS    80557    81429    .    +    0    ID=10;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:cds:0;Parent=2
contig00005    maker    CDS    81587    81910    .    +    0    ID=11;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:cds:1;Parent=2
contig00005    maker    three_prime_UTR    81911    82008    .    +    .    ID=12;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:UTR4;Parent=2
contig00005    maker    start_codon    80557    80559    .    +    .    ID=13;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:start1;Parent=2
contig00005    maker    stop_codon    81908    81910    .    +    .    ID=14;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:stop2;Parent=2


Does anyone have any idea how to control this so that my I.D's don't change, and/ or how to use the other script without getting the errors?


Thanks in advance,

Claudia


Carson Holt
Graduate Student
Yandell Lab
http:/www.yandell-lab.org/
Eccles Institute of Human Genetics
University of Utah
 



Attention: The information contained in this message and/or attachments from AgResearch Limited is intended only for the persons or entities to which it is addressed and may contain confidential and/or privileged material. Any review, retransmission, dissemination or other use of, or taking of any action in reliance upon, this information by persons or entities other than the intended recipients is prohibited by AgResearch Limited. If you have received this message in error, please notify the sender immediately.




 
<ATT00001..txt>

Barry Moore
Research Scientist
Dept. of Human Genetics
University of Utah
Salt Lake City, UT 84112
--------------------------------------------
(801) 585-3543





_______________________________________________
maker-devel mailing list
[hidden email]
http://box290.bluehost.com/mailman/listinfo/maker-devel_yandell-lab.org
Reply | Threaded
Open this post in threaded view
|

Re: usage of add_utr scripts

Khan, Anar
<base href="x-msg://63/">

Hi

 

Thank you both for your replies.

 

Barry, you reminded me that I had in fact meddled with the gff output, so what I was passing to the add_utr scripts wasn’t actually bona fide MAKER output. I had stripped the fasta from the gff file, for some reason. Running add_utr_start_stop_gff on the original MAKER gff worked without errors. My mistake, I apologise! Thank you for picking this up.

 

Carson, thanks for explaining the workaround. We can’t afford to pay for an upgrade right now, but will obtain the new version when our current license expires and we renew. I’m happy to hear MAKER now adds UTRs by default.

 

Cheers

Anar

 

 

From: Carson Holt [mailto:[hidden email]]
Sent: Wednesday, 14 September 2011 2:02 p.m.
To: Barry Moore; Khan, Anar
Cc: Dinatale C; [hidden email]
Subject: Re: [maker-devel] usage of add_utr scripts

 

2.11 and above no longer have those scripts (UTR is always added by default now).  So this is a round about way to do it --> upgrade MAKER, then provide the merged GFF3 (made with gff3_merge) to genome_gff in the makeropts.ctl file, then set all pass options to 1.  Then make sure to clear all other options (repeatprotein model_org etc.).  When you rerun MAKER, it will just update the GFF3.

 

--Carson

 

 

 

From: Barry Moore <[hidden email]>
Date: Tue, 13 Sep 2011 18:41:16 -0600
To: "Khan, Anar" <[hidden email]>
Cc: Carson Holt <[hidden email]>, Dinatale C <[hidden email]>, "[hidden email]" <[hidden email]>
Subject: Re: [maker-devel] usage of add_utr scripts

 

Hi Anar,

 

I looks like the add_utr_to_gff3 script is completely hosed in that release.  I think it's just there as a relic and add_utr_start_stop_gff is the script that you want.  From those errors it looks like your gff3 file doesn't have the fasta sequence at the end of it. 

 

If I've guessed correctly, the quick workaround would be to just add lines like the following to the end of your GFF3 file - where ,of course, the fasta header and sequence are replaced with the correct data for the chromosome(s)/contig(s) represented in your GFF3 file. 

 

##FASTA

>chr1

ATCGATACGTAG...

 

B

 

On Sep 13, 2011, at 5:22 PM, Khan, Anar wrote:



Hi Carson

 

I’m running maker version 2.09 and would like to add UTRs to the gff output. I’m getting errors for both the add_utr_start_stop_gff and add_utr_to_gff3 scripts.

 

I see from your previous post that an upgrade is recommended but I don’t think our license includes upgrades.

 

I was just wondering if you have any ideas on how to obtain UTR info using v2.09? The script errors are pasted below.

 

Cheers

Anar

 

add_utr_to_gff3 test_maker.orig.gff

Global symbol "$s" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 26.

Global symbol "@files" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 34.

Global symbol "@files" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 40.

Global symbol "@files" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 48.

Global symbol "@stuff" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 63.

Global symbol "@stuff" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 86.

Global symbol "@stuff" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 87.

Global symbol "@stuff" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 88.

Global symbol "$header" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 93.

Global symbol "$footer" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 96.

Global symbol "@stuff" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 111.

Global symbol "$footer" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 187.

Global symbol "@regions" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 445.

Missing right curly or square bracket at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 489, at end of line

/usr/local/maker_2.09beta/bin/add_utr_to_gff3 has too many errors.

 

--------------

 

add_utr_start_stop_gff test_maker.orig.gff

Use of uninitialized value in split at /usr/local/maker_2.09beta/bin/add_utr_start_stop_gff line 89.

Use of uninitialized value in substr at /usr/local/maker_2.09beta/bin/add_utr_start_stop_gff line 305.

substr outside of string at /usr/local/maker_2.09beta/bin/add_utr_start_stop_gff line 305.

Use of uninitialized value in substr at /usr/local/maker_2.09beta/bin/add_utr_start_stop_gff line 306.

substr outside of string at /usr/local/maker_2.09beta/bin/add_utr_start_stop_gff line 306.

Use of uninitialized value in print at /usr/local/maker_2.09beta/bin/add_utr_start_stop_gff line 180.

 

 

 

From: [hidden email] [[hidden email]] On Behalf Of Carson Holt
Sent: Friday, 13 May 2011 5:54 a.m.
To: Dinatale C; [hidden email]
Subject: Re: [maker-devel] usage of add_utr scripts

 

The add_utr_to_gff3 script should have been pulled out, and somehow was left in the 2.10 release.  The add_utr_start_stop_codon_gff script changes IDs, which normally isn’t too big a problem because chado will also change them (ID’s just need to be unique and are not necessarily informative).

If you need those to be unchanged, I’ll modify the script.  It will be a few days though because I’m defending my thesis on Monday.

You can also use MAKER 2.11-beta.  Named UTR is supposed to be optional in GFF3 format because it is always implicit anyway, but MAKER 2.11 now adds explicit UTR by default to better work with JBrowse.  Just give it the other MAKER’s GFF3 file and turn all the pass options to 1.  Don’t use other input files including repeatmasker options.

Thanks,
Carson

On 5/12/11 11:20 AM, "Dinatale C" <<a href="x-msg://63/dinatal@uwindsor.ca">dinatal@...> wrote:

Hi,

  So I have attempted to use the add_utr_to_gff3 script and it seems to return many errors, see below: ( I have attached the gff3 file I was trying to update)


Global symbol "$s" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 26.
Global symbol "@files" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 34.
Global symbol "@files" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 40.
Global symbol "@files" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 48.
Global symbol "@stuff" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 63.
Global symbol "@stuff" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 86.
Global symbol "@stuff" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 87.
Global symbol "@stuff" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 88.
Global symbol "$header" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 93.
Global symbol "$footer" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 96.
Global symbol "@stuff" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 111.
Global symbol "$footer" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 187.
Global symbol "@regions" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 445.
Missing right curly or square bracket at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 489, at end of line
/usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 has too many errors.


so I then tried the other script add_utr_start_stop_codon_gff instead and the script added the features however it seems to change all of my I.D's to an increasing numerical order, see below:


##gff-version 3
contig00005    maker    gene    78618    82008    .    +    .    ID=1;Name=augustus_masked-contig00005-abinit-gene-0.15
contig00005    maker    mRNA    78618    82008    .    +    .    ID=2;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1;Parent=1
contig00005    maker    exon    78618    78709    0.04    +    .    ID=3;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:exon:0;Parent=2
contig00005    maker    exon    79771    79837    0.04    +    .    ID=4;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:exon:1;Parent=2
contig00005    maker    exon    80234    81429    0.04    +    .    ID=5;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:exon:2;Parent=2
contig00005    maker    exon    81587    82008    6726    +    .    ID=6;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:exon:3;Parent=2
contig00005    maker    five_prime_UTR    78618    78709    .    +    .    ID=7;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:UTR1;Parent=2
contig00005    maker    five_prime_UTR    79771    79837    .    +    .    ID=8;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:UTR2;Parent=2
contig00005    maker    five_prime_UTR    80234    80556    .    +    .    ID=9;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:UTR3;Parent=2
contig00005    maker    CDS    80557    81429    .    +    0    ID=10;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:cds:0;Parent=2
contig00005    maker    CDS    81587    81910    .    +    0    ID=11;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:cds:1;Parent=2
contig00005    maker    three_prime_UTR    81911    82008    .    +    .    ID=12;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:UTR4;Parent=2
contig00005    maker    start_codon    80557    80559    .    +    .    ID=13;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:start1;Parent=2
contig00005    maker    stop_codon    81908    81910    .    +    .    ID=14;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:stop2;Parent=2



Does anyone have any idea how to control this so that my I.D's don't change, and/ or how to use the other script without getting the errors?


Thanks in advance,

Claudia



Carson Holt
Graduate Student
Yandell Lab
http:/www.yandell-lab.org/
Eccles Institute of Human Genetics
University of Utah

 

 


 

Attention: The information contained in this message and/or attachments from AgResearch Limited is intended only for the persons or entities to which it is addressed and may contain confidential and/or privileged material. Any review, retransmission, dissemination or other use of, or taking of any action in reliance upon, this information by persons or entities other than the intended recipients is prohibited by AgResearch Limited. If you have received this message in error, please notify the sender immediately.

 


 

 

<ATT00001..txt>

 

Barry Moore

Research Scientist

Dept. of Human Genetics

University of Utah

Salt Lake City, UT 84112

--------------------------------------------

(801) 585-3543

 

 

 

 


_______________________________________________
maker-devel mailing list
[hidden email]
http://box290.bluehost.com/mailman/listinfo/maker-devel_yandell-lab.org
Reply | Threaded
Open this post in threaded view
|

Re: usage of add_utr scripts

Khan, Anar
In reply to this post by Carson Hinton Holt
<base href="x-msg://63/">

Hi

 

I apologise for asking another question about this script given it’s been deprecated, but I am curious about something and wonder whether you might be able to help. I ran add_utr_start_stop_gff on some maker output and no UTRs were added. For this species I have no est data, only altest data. Does this explain why no UTRs could be identified? What info is used to add UTR coordinates?

 

Thanks very much,

Anar

 

 

From: Carson Holt [mailto:[hidden email]]
Sent: Wednesday, 14 September 2011 2:02 p.m.
To: Barry Moore; Khan, Anar
Cc: Dinatale C; [hidden email]
Subject: Re: [maker-devel] usage of add_utr scripts

 

2.11 and above no longer have those scripts (UTR is always added by default now).  So this is a round about way to do it --> upgrade MAKER, then provide the merged GFF3 (made with gff3_merge) to genome_gff in the makeropts.ctl file, then set all pass options to 1.  Then make sure to clear all other options (repeatprotein model_org etc.).  When you rerun MAKER, it will just update the GFF3.

 

--Carson

 

 

 

From: Barry Moore <[hidden email]>
Date: Tue, 13 Sep 2011 18:41:16 -0600
To: "Khan, Anar" <[hidden email]>
Cc: Carson Holt <[hidden email]>, Dinatale C <[hidden email]>, "[hidden email]" <[hidden email]>
Subject: Re: [maker-devel] usage of add_utr scripts

 

Hi Anar,

 

I looks like the add_utr_to_gff3 script is completely hosed in that release.  I think it's just there as a relic and add_utr_start_stop_gff is the script that you want.  From those errors it looks like your gff3 file doesn't have the fasta sequence at the end of it. 

 

If I've guessed correctly, the quick workaround would be to just add lines like the following to the end of your GFF3 file - where ,of course, the fasta header and sequence are replaced with the correct data for the chromosome(s)/contig(s) represented in your GFF3 file. 

 

##FASTA

>chr1

ATCGATACGTAG...

 

B

 

 


Attention: The information contained in this message and/or attachments from AgResearch Limited is intended only for the persons or entities to which it is addressed and may contain confidential and/or privileged material. Any review, retransmission, dissemination or other use of, or taking of any action in reliance upon, this information by persons or entities other than the intended recipients is prohibited by AgResearch Limited. If you have received this message in error, please notify the sender immediately.


 


_______________________________________________
maker-devel mailing list
[hidden email]
http://box290.bluehost.com/mailman/listinfo/maker-devel_yandell-lab.org
Reply | Threaded
Open this post in threaded view
|

Re: usage of add_utr scripts

Carson Hinton Holt
In reply to this post by claudia
Yes.  That would be why.

--Carson

From: "Khan, Anar" <[hidden email]>
Date: Wed, 28 Sep 2011 18:59:04 -0600
To: Carson Holt <[hidden email]>, Barry Moore <[hidden email]>
Cc: Dinatale C <[hidden email]>, "[hidden email]" <[hidden email]>
Subject: RE: [maker-devel] usage of add_utr scripts

<base href="x-msg://63/">

Hi

 

I ran add_utr_start_stop_gff on some maker output and no UTRs were added. For this species I have no est data, only altest data. Does this explain why no UTRs could be identified –perhaps altest evidence isn’t used when attempting to define UTRs?

 

Cheers :-)

Anar

 

From: Khan, Anar
Sent: Wednesday, 14 September 2011 4:19 p.m.
To: 'Carson Holt'; Barry Moore
Cc: Dinatale C; [hidden email]
Subject: RE: [maker-devel] usage of add_utr scripts

 

Hi

 

Thank you both for your replies.

 

Barry, you reminded me that I had in fact meddled with the gff output, so what I was passing to the add_utr scripts wasn’t actually bona fide MAKER output. I had stripped the fasta from the gff file, for some reason. Running add_utr_start_stop_gff on the original MAKER gff worked without errors. My mistake, I apologise! Thank you for picking this up.

 

Carson, thanks for explaining the workaround. We can’t afford to pay for an upgrade right now, but will obtain the new version when our current license expires and we renew. I’m happy to hear MAKER nowadds UTRs by default.

 

Cheers

Anar

 

 

From: Carson Holt [[hidden email]]
Sent: Wednesday, 14 September 2011 2:02 p.m.
To: Barry Moore; Khan, Anar
Cc: Dinatale C; [hidden email]
Subject: Re: [maker-devel] usage of add_utr scripts

 

2.11 and above no longer have those scripts (UTR is always added by default now).  So this is a round about way to do it --> upgradeMAKER, then provide the merged GFF3 (made with gff3_merge) to genome_gff in the makeropts.ctl file, then set all pass options to 1.  Then make sure to clear all other options (repeatprotein model_org etc.).  When you rerun MAKER, it will just update the GFF3.

 

--Carson

 

 

 

From: Barry Moore <[hidden email]>
Date: Tue, 13 Sep 2011 18:41:16 -0600
To: "Khan, Anar" <[hidden email]>
Cc: Carson Holt <[hidden email]>, Dinatale C <[hidden email]>, "[hidden email]" <[hidden email]>
Subject: Re: [maker-devel] usage of add_utr scripts

 

Hi Anar,

 

I looks like the add_utr_to_gff3 script is completely hosed in that release.  I think it's just there as a relic and add_utr_start_stop_gff is the script that you want.  From those errors it looks like your gff3 file doesn't have the fasta sequence at the end of it. 

 

If I've guessed correctly, the quick workaround would be to just add lines like the following to the end of your GFF3 file - where ,of course, the fasta header and sequence are replaced with the correct data for the chromosome(s)/contig(s) represented in your GFF3 file. 

 

##FASTA

>chr1

ATCGATACGTAG...

 

B

 

On Sep 13, 2011, at 5:22 PM, Khan, Anar wrote:

 

Hi Carson

 

I’m running maker version 2.09 and would like to add UTRs to the gff output. I’m getting errors for both the add_utr_start_stop_gff and add_utr_to_gff3 scripts.

 

I see from your previous post that an upgrade is recommended but I don’t think our license includes upgrades.

 

I was just wondering if you have any ideas on how to obtain UTR info using v2.09? The script errors are pasted below.

 

Cheers

Anar

 

add_utr_to_gff3 test_maker.orig.gff

Global symbol "$s" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 26.

Global symbol "@files" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 34.

Global symbol "@files" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 40.

Global symbol "@files" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 48.

Global symbol "@stuff" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 63.

Global symbol "@stuff" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 86.

Global symbol "@stuff" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 87.

Global symbol "@stuff" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 88.

Global symbol "$header" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 93.

Global symbol "$footer" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 96.

Global symbol "@stuff" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 111.

Global symbol "$footer" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 187.

Global symbol "@regions" requires explicit package name at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 445.

Missing right curly or square bracket at /usr/local/maker_2.09beta/bin/add_utr_to_gff3 line 489, at end of line

/usr/local/maker_2.09beta/bin/add_utr_to_gff3 has too many errors.

 

--------------

 

add_utr_start_stop_gff test_maker.orig.gff

Use of uninitialized value in split at /usr/local/maker_2.09beta/bin/add_utr_start_stop_gff line 89.

Use of uninitialized value in substr at /usr/local/maker_2.09beta/bin/add_utr_start_stop_gff line 305.

substr outside of string at /usr/local/maker_2.09beta/bin/add_utr_start_stop_gff line 305.

Use of uninitialized value in substr at /usr/local/maker_2.09beta/bin/add_utr_start_stop_gff line 306.

substr outside of string at /usr/local/maker_2.09beta/bin/add_utr_start_stop_gff line 306.

Use of uninitialized value in print at /usr/local/maker_2.09beta/bin/add_utr_start_stop_gff line 180.

 

 

 

From: [hidden email] [[hidden email]] On Behalf Of Carson Holt
Sent: Friday, 13 May 2011 5:54 a.m.
To: Dinatale C; [hidden email]
Subject: Re: [maker-devel] usage of add_utr scripts

 

The add_utr_to_gff3 script should have been pulled out, and somehow was left in the 2.10 release.  The add_utr_start_stop_codon_gff script changes IDs, which normally isn’t too big a problem because chado will also change them (ID’s just need to be unique and are not necessarily informative).

If you need those to be unchanged, I’ll modify the script.  It will be a few days though because I’m defending my thesis on Monday.

You can also use MAKER 2.11-beta.  Named UTR is supposed to be optional in GFF3 format because it is always implicit anyway, but MAKER 2.11 now adds explicit UTR by default to better work with JBrowse.  Just give it theother MAKER’s GFF3 file and turn all the pass options to 1.  Don’t use other input files including repeatmasker options.

Thanks,
Carson

On 5/12/11 11:20 AM, "Dinatale C" <<a href="x-msg://63/dinatal@uwindsor.ca">dinatal@...> wrote:

Hi,

  So I have attempted to use the add_utr_to_gff3 script and it seems to return many errors, see below: ( I have attached the gff3 file I was trying to update)


Global symbol "$s" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 26.
Global symbol "@files" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 34.
Global symbol "@files" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 40.
Global symbol "@files" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 48.
Global symbol "@stuff" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 63.
Global symbol "@stuff" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 86.
Global symbol "@stuff" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 87.
Global symbol "@stuff" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 88.
Global symbol "$header" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 93.
Global symbol "$footer" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 96.
Global symbol "@stuff" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 111.
Global symbol "$footer" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 187.
Global symbol "@regions" requires explicit package name at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 445.
Missing right curly or square bracket at /usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 line 489, at end of line
/usr/local/genome/maker/maker-2.10/bin/add_utr_to_gff3 has too many errors.


so I then tried the other script add_utr_start_stop_codon_gff instead and the script added the features however it seems to change all of my I.D's to an increasing numerical order, see below:


##gff-version 3
contig00005    maker    gene    78618    82008    .    +    .    ID=1;Name=augustus_masked-contig00005-abinit-gene-0.15
contig00005    maker    mRNA    78618    82008    .    +    .    ID=2;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1;Parent=1
contig00005    maker    exon    78618    78709    0.04    +    .    ID=3;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:exon:0;Parent=2
contig00005    maker    exon    79771    79837    0.04    +    .    ID=4;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:exon:1;Parent=2
contig00005    maker    exon    80234    81429    0.04    +    .    ID=5;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:exon:2;Parent=2
contig00005    maker    exon    81587    82008    6726    +    .    ID=6;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:exon:3;Parent=2
contig00005    maker    five_prime_UTR    78618    78709    .    +    .    ID=7;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:UTR1;Parent=2
contig00005    maker    five_prime_UTR    79771    79837    .    +    .    ID=8;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:UTR2;Parent=2
contig00005    maker    five_prime_UTR    80234    80556    .    +    .    ID=9;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:UTR3;Parent=2
contig00005    maker    CDS    80557    81429    .    +    0    ID=10;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:cds:0;Parent=2
contig00005    maker    CDS    81587    81910    .    +    0    ID=11;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:cds:1;Parent=2
contig00005    maker    three_prime_UTR    81911    82008    .    +    .    ID=12;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:UTR4;Parent=2
contig00005    maker    start_codon    80557    80559    .    +    .    ID=13;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:start1;Parent=2
contig00005    maker    stop_codon    81908    81910    .    +    .    ID=14;Name=augustus_masked-contig00005-abinit-gene-0.15-mRNA-1:stop2;Parent=2



Does anyone have any idea how to control this so that my I.D's don't change, and/ or how to use the other script without getting the errors?


Thanks in advance,

Claudia


Carson Holt
Graduate Student
Yandell Lab
http:/www.yandell-lab.org/
Eccles Institute of Human Genetics
University of Utah

 

 


 

Attention: The information contained in this message and/or attachments from AgResearch Limited is intended only for the persons or entities to which it is addressed and may contain confidential and/or privileged material. Any review, retransmission, dissemination or other use of, or taking of any action in reliance upon, this information by persons or entities other than the intended recipients is prohibited by AgResearch Limited. If you have received this message in error, please notify the sender immediately.

 


 

 

<ATT00001..txt>

 

Barry Moore

Research Scientist

Dept. of Human Genetics

University of Utah

Salt Lake City, UT 84112

--------------------------------------------

(801) 585-3543

 

 

 

 


_______________________________________________
maker-devel mailing list
[hidden email]
http://box290.bluehost.com/mailman/listinfo/maker-devel_yandell-lab.org
Reply | Threaded
Open this post in threaded view
|

Re: usage of add_utr scripts

Carson Hinton Holt
In reply to this post by Khan, Anar
ESTs that overlap the splice sites of the prediction are used to extend the gene model.  You can think of it as a kind of cut and paste operation.  Alt_est does not match the splice sites correctly (there are several technical reasons why).  Also since  alt_est is aligned in protein space (tblastx) the UTR region may not line up as it is non-coding, so it is not under the same evolutionary constraints as the rest of the alignment.

Thanks,
Carson


From: "Khan, Anar" <[hidden email]>
Date: Wed, 28 Sep 2011 19:07:53 -0600
To: Carson Holt <[hidden email]>, Barry Moore <[hidden email]>
Cc: Dinatale C <[hidden email]>, "[hidden email]" <[hidden email]>
Subject: RE: [maker-devel] usage of add_utr scripts

<base href="x-msg://63/">

Hi

 

I apologise for asking another question about this script given it’s been deprecated, but I am curious about something and wonder whether you might be able to help. I ran add_utr_start_stop_gff on some maker output and no UTRs were added. For this species I have no est data, only altest data. Does this explain why no UTRs could be identified? What info is used to add UTR coordinates?

 

Thanks very much,

Anar

 

 

From: Carson Holt [[hidden email]]
Sent: Wednesday, 14 September 2011 2:02 p.m.
To: Barry Moore; Khan, Anar
Cc: Dinatale C; [hidden email]
Subject: Re: [maker-devel] usage of add_utr scripts

 

2.11 and above no longer have those scripts (UTR is always added by default now).  So this is a round about way to do it --> upgradeMAKER, then provide the merged GFF3 (made with gff3_merge) to genome_gff in the makeropts.ctl file, then set all pass options to 1.  Then make sure to clear all other options (repeatprotein model_org etc.).  When you rerun MAKER, it will just update the GFF3.

 

--Carson

 

 

 

From: Barry Moore <[hidden email]>
Date: Tue, 13 Sep 2011 18:41:16 -0600
To: "Khan, Anar" <[hidden email]>
Cc: Carson Holt <[hidden email]>, Dinatale C <[hidden email]>, "[hidden email]" <[hidden email]>
Subject: Re: [maker-devel] usage of add_utr scripts

 

Hi Anar,

 

I looks like the add_utr_to_gff3 script is completely hosed in that release.  I think it's just there as a relic and add_utr_start_stop_gff is the script that you want.  From those errors it looks like your gff3 file doesn't have the fasta sequence at the end of it. 

 

If I've guessed correctly, the quick workaround would be to just add lines like the following to the end of your GFF3 file - where ,of course, the fasta header and sequence are replaced with the correct data for the chromosome(s)/contig(s) represented in your GFF3 file. 

 

##FASTA

>chr1

ATCGATACGTAG...

 

B

 

 


Attention: The information contained in this message and/or attachments from AgResearch Limited is intended only for the persons or entities to which it is addressed and may contain confidential and/or privileged material. Any review, retransmission, dissemination or other use of, or taking of any action in reliance upon, this information by persons or entities other than the intended recipients is prohibited by AgResearch Limited. If you have received this message in error, please notify the sender immediately.


 


_______________________________________________
maker-devel mailing list
[hidden email]
http://box290.bluehost.com/mailman/listinfo/maker-devel_yandell-lab.org
Reply | Threaded
Open this post in threaded view
|

Re: usage of add_utr scripts

Khan, Anar
<base href="x-msg://63/">

Hi Carson

 

Thanks a lot for the explanation, all clear now.

 

Cheers

Anar

 

From: Carson Holt [mailto:[hidden email]]
Sent: Thursday, 29 September 2011 2:13 p.m.
To: Khan, Anar; Barry Moore
Cc: Dinatale C; [hidden email]
Subject: Re: [maker-devel] usage of add_utr scripts

 

ESTs that overlap the splice sites of the prediction are used to extend the gene model.  You can think of it as a kind of cut and paste operation.  Alt_est does not match the splice sites correctly (there are several technical reasons why).  Also since  alt_est is aligned in protein space (tblastx) the UTR region may not line up as it is non-coding, so it is not under the same evolutionary constraints as the rest of the alignment.

 

Thanks,

Carson

 

 

From: "Khan, Anar" <[hidden email]>
Date: Wed, 28 Sep 2011 19:07:53 -0600
To: Carson Holt <[hidden email]>, Barry Moore <[hidden email]>
Cc: Dinatale C <[hidden email]>, "[hidden email]" <[hidden email]>
Subject: RE: [maker-devel] usage of add_utr scripts

 

Hi

 

I apologise for asking another question about this script given it’s been deprecated, but I am curious about something and wonder whether you might be able to help. I ran add_utr_start_stop_gff on some maker output and no UTRs were added. For this species I have no est data, only altest data. Does this explain why no UTRs could be identified? What info is used to add UTR coordinates?

 

Thanks very much,

Anar

 

 

From: Carson Holt [[hidden email]]
Sent: Wednesday, 14 September 2011 2:02 p.m.
To: Barry Moore; Khan, Anar
Cc: Dinatale C; [hidden email]
Subject: Re: [maker-devel] usage of add_utr scripts

 

2.11 and above no longer have those scripts (UTR is always added by default now).  So this is a round about way to do it --> upgradeMAKER, then provide the merged GFF3 (made with gff3_merge) to genome_gff in the makeropts.ctl file, then set all pass options to 1.  Then make sure to clear all other options (repeatprotein model_org etc.).  When you rerun MAKER, it will just update the GFF3.

 

--Carson

 

 

 

From: Barry Moore <[hidden email]>
Date: Tue, 13 Sep 2011 18:41:16 -0600
To: "Khan, Anar" <[hidden email]>
Cc: Carson Holt <[hidden email]>, Dinatale C <[hidden email]>, "[hidden email]" <[hidden email]>
Subject: Re: [maker-devel] usage of add_utr scripts

 

Hi Anar,

 

I looks like the add_utr_to_gff3 script is completely hosed in that release.  I think it's just there as a relic and add_utr_start_stop_gff is the script that you want.  From those errors it looks like your gff3 file doesn't have the fasta sequence at the end of it. 

 

If I've guessed correctly, the quick workaround would be to just add lines like the following to the end of your GFF3 file - where ,of course, the fasta header and sequence are replaced with the correct data for the chromosome(s)/contig(s) represented in your GFF3 file. 

 

##FASTA

>chr1

ATCGATACGTAG...

 

B

 

 


Attention: The information contained in this message and/or attachments from AgResearch Limited is intended only for the persons or entities to which it is addressed and may contain confidential and/or privileged material. Any review, retransmission, dissemination or other use of, or taking of any action in reliance upon, this information by persons or entities other than the intended recipients is prohibited by AgResearch Limited. If you have received this message in error, please notify the sender immediately.


 


_______________________________________________
maker-devel mailing list
[hidden email]
http://box290.bluehost.com/mailman/listinfo/maker-devel_yandell-lab.org
Reply | Threaded
Open this post in threaded view
|

Re: usage of add_utr scripts

Khan, Anar
<base href="x-msg://63/">

Hi

 

I’d like to try to add UTR coordinates predicted by the ab initio gene finders, given I can’t derive them from est alignments due to a lack of EST data. I see raw output is available in “the void” folder. Before writing a rather convoluted script which attempts to match snap/augustus/fgenesh predictions with maker predictions, does a script already exist; or can anyone offer suggestions/advice?

 

Cheers

Anar

 

From: [hidden email] [mailto:[hidden email]] On Behalf Of Khan, Anar
Sent: Thursday, 29 September 2011 2:28 p.m.
To: Carson Holt; Barry Moore
Cc: [hidden email]; Dinatale C
Subject: Re: [maker-devel] usage of add_utr scripts

 

Hi Carson

 

Thanks a lot for the explanation, all clear now.

 

Cheers

Anar

 

From: Carson Holt [mailto:[hidden email]]
Sent: Thursday, 29 September 2011 2:13 p.m.
To: Khan, Anar; Barry Moore
Cc: Dinatale C; [hidden email]
Subject: Re: [maker-devel] usage of add_utr scripts

 

ESTs that overlap the splice sites of the prediction are used to extend the gene model.  You can think of it as a kind of cut and paste operation.  Alt_est does not match the splice sites correctly (there are several technical reasons why).  Also since  alt_est is aligned in protein space (tblastx) the UTR region may not line up as it is non-coding, so it is not under the same evolutionary constraints as the rest of the alignment.

 

Thanks,

Carson

 

 

From: "Khan, Anar" <[hidden email]>
Date: Wed, 28 Sep 2011 19:07:53 -0600
To: Carson Holt <[hidden email]>, Barry Moore <[hidden email]>
Cc: Dinatale C <[hidden email]>, "[hidden email]" <[hidden email]>
Subject: RE: [maker-devel] usage of add_utr scripts

 

Hi

 

I apologise for asking another question about this script given it’s been deprecated, but I am curious about something and wonder whether you might be able to help. I ran add_utr_start_stop_gff on some maker output and no UTRs were added. For this species I have no est data, only altest data. Does this explain why no UTRs could be identified? What info is used to add UTR coordinates?

 

Thanks very much,

Anar

 

 

From: Carson Holt [[hidden email]]
Sent: Wednesday, 14 September 2011 2:02 p.m.
To: Barry Moore; Khan, Anar
Cc: Dinatale C; [hidden email]
Subject: Re: [maker-devel] usage of add_utr scripts

 

2.11 and above no longer have those scripts (UTR is always added by default now).  So this is a round about way to do it --> upgradeMAKER, then provide the merged GFF3 (made with gff3_merge) to genome_gff in the makeropts.ctl file, then set all pass options to 1.  Then make sure to clear all other options (repeatprotein model_org etc.).  When you rerun MAKER, it will just update the GFF3.

 

--Carson

 

 

 

From: Barry Moore <[hidden email]>
Date: Tue, 13 Sep 2011 18:41:16 -0600
To: "Khan, Anar" <[hidden email]>
Cc: Carson Holt <[hidden email]>, Dinatale C <[hidden email]>, "[hidden email]" <[hidden email]>
Subject: Re: [maker-devel] usage of add_utr scripts

 

Hi Anar,

 

I looks like the add_utr_to_gff3 script is completely hosed in that release.  I think it's just there as a relic and add_utr_start_stop_gff is the script that you want.  From those errors it looks like your gff3 file doesn't have the fasta sequence at the end of it. 

 

If I've guessed correctly, the quick workaround would be to just add lines like the following to the end of your GFF3 file - where ,of course, the fasta header and sequence are replaced with the correct data for the chromosome(s)/contig(s) represented in your GFF3 file. 

 

##FASTA

>chr1

ATCGATACGTAG...

 

B

 

 


Attention: The information contained in this message and/or attachments from AgResearch Limited is intended only for the persons or entities to which it is addressed and may contain confidential and/or privileged material. Any review, retransmission, dissemination or other use of, or taking of any action in reliance upon, this information by persons or entities other than the intended recipients is prohibited by AgResearch Limited. If you have received this message in error, please notify the sender immediately.


 


_______________________________________________
maker-devel mailing list
[hidden email]
http://box290.bluehost.com/mailman/listinfo/maker-devel_yandell-lab.org
Reply | Threaded
Open this post in threaded view
|

Re: usage of add_utr scripts

Carson Hinton Holt
Well… It is more complicated than you may think.  SNAP doesn't predict UTR, fgenesh couldn't predict these last time I used it (but that was a few versions ago), and Augustus can but really only for a couple of model organisms (to get it to work on other organisms you have to have the coordinates  for a significant fraction of known UTRs throughout the genome).  UTR annotation is done downstream of the ab initio prediction in most cases.  So parsing the ab initio predictions will not give you these because they really aren't part of the output to begin with.  This is why MAKER uses ESTs to predict them.

--Carson

From: "Khan, Anar" <[hidden email]>
Date: Thu, 27 Oct 2011 15:53:53 -0600
To: Carson Holt <[hidden email]>, Barry Moore <[hidden email]>
Cc: "[hidden email]" <[hidden email]>, Dinatale C <[hidden email]>
Subject: RE: [maker-devel] usage of add_utr scripts

<base href="x-msg://63/">

Hi

 

I’d like to try to add UTR coordinates predicted by the ab initio gene finders, given I can’t derive them from est alignments due to a lack of EST data. I see raw output is available in “the void”folder. Before writing a rather convoluted script which attempts to match snap/augustus/fgenesh predictions with maker predictions, does a script already exist; or can anyone offer suggestions/advice?

 

Cheers

Anar

 

From: [hidden email] [[hidden email]] On Behalf Of Khan, Anar
Sent: Thursday, 29 September 2011 2:28 p.m.
To: Carson Holt; Barry Moore
Cc: [hidden email]; Dinatale C
Subject: Re: [maker-devel] usage of add_utr scripts

 

Hi Carson

 

Thanks a lot for the explanation, all clear now.

 

Cheers

Anar

 

From: Carson Holt [[hidden email]]
Sent: Thursday, 29 September 2011 2:13 p.m.
To: Khan, Anar; Barry Moore
Cc: Dinatale C; [hidden email]
Subject: Re: [maker-devel] usage of add_utr scripts

 

ESTs that overlap the splice sites of the prediction are used to extend the gene model.  You can think of it as a kind of cut and pasteoperation.  Alt_est does not match the splice sites correctly (there are several technical reasons why).  Also since  alt_est is aligned in protein space (tblastx) the UTR region may not line up as it is non-coding, so it is not under the same evolutionary constraints as the rest of the alignment.

 

Thanks,

Carson

 

 

From: "Khan, Anar" <[hidden email]>
Date: Wed, 28 Sep 2011 19:07:53 -0600
To: Carson Holt <[hidden email]>, Barry Moore <[hidden email]>
Cc: Dinatale C <[hidden email]>, "[hidden email]" <[hidden email]>
Subject: RE: [maker-devel] usage of add_utr scripts

 

Hi

 

I apologise for asking another question about this script given it’s been deprecated, but I am curious about something and wonder whether you might be able to help. I ran add_utr_start_stop_gff on some maker output and no UTRs were added. For this species I have no est data, only altest data. Does this explain why no UTRs could be identified? What info is used to add UTR coordinates?

 

Thanks very much,

Anar

 

 

From: Carson Holt [[hidden email]]
Sent: Wednesday, 14 September 2011 2:02 p.m.
To: Barry Moore; Khan, Anar
Cc: Dinatale C; [hidden email]
Subject: Re: [maker-devel] usage of add_utr scripts

 

2.11 and above no longer have those scripts (UTR is always added by default now).  So this is a round about way to do it --> upgradeMAKER, then provide the merged GFF3 (made with gff3_merge) to genome_gff in the makeropts.ctl file, then set all pass options to 1.  Then make sure to clear all other options (repeatprotein model_org etc.).  When you rerun MAKER, it will just update the GFF3.

 

--Carson

 

 

 

From: Barry Moore <[hidden email]>
Date: Tue, 13 Sep 2011 18:41:16 -0600
To: "Khan, Anar" <[hidden email]>
Cc: Carson Holt <[hidden email]>, Dinatale C <[hidden email]>, "[hidden email]" <[hidden email]>
Subject: Re: [maker-devel] usage of add_utr scripts

 

Hi Anar,

 

I looks like the add_utr_to_gff3 script is completely hosed in that release.  I think it's just there as a relic and add_utr_start_stop_gff is the script that you want.  From those errors it looks like your gff3 file doesn't have the fasta sequence at the end of it. 

 

If I've guessed correctly, the quick workaround would be to just add lines like the following to the end of your GFF3 file - where ,of course, the fasta header and sequence are replaced with the correct data for the chromosome(s)/contig(s) represented in your GFF3 file. 

 

##FASTA

>chr1

ATCGATACGTAG...

 

B

 

 


Attention: The information contained in this message and/or attachments from AgResearch Limited is intended only for the persons or entities to which it is addressed and may contain confidential and/or privileged material. Any review, retransmission, dissemination or other use of, or taking of any action in reliance upon, this information by persons or entities other than the intended recipients is prohibited by AgResearch Limited. If you have received this message in error, please notify the sender immediately.


 


_______________________________________________
maker-devel mailing list
[hidden email]
http://box290.bluehost.com/mailman/listinfo/maker-devel_yandell-lab.org