[biomart-users] Understand the meaning of percentage_gc_content
Somehow my previous message was not sent. I redrafted the same message I sent before.
I'm trying to understand the meaning of the percentage_gc_content generated by biomaRt. I found that it was quite strange that all transcripts of a gene gave the same percentage_gc_content. For example, for the two transcripts (ENSMUST00000187148,ENSMUST00000115891) of gene ENSMUSG00000000103, the percentage_gc_content is exactly the same 36.56 (see the R code below).
I did the computation manually and the the percentage_gc_content for the two transcripts (ENSMUST00000187148,ENSMUST00000115891) is respectively 39.46 and 40.20 (see the R code below). These two numbers are also confirmed from another source that is independent of the R code below
1. So my question is, what is the percentage_gc_content that is generated by biomaRt?
2. While I was exploring the BM function of the biomaRt package, there is a bug if we wanted to use the attributes "cdna" or "gene_exon", which will shift the columns names, see a print out of the variable seq in the following R code.
---------------------------------------- The following is the R code