Train database usage

Hi, 

I noticed you are using a combination of database including rnacentral, rfam, ensembl and nt. 

Can I please ask why did you chose these databases? 

Specifically, rnacentral should be a superset of rfam and ensembl. While nt is not a part of rnacentral, it should have been very similar to the ENA database, which is also a subset of rnacentral. 

Besides, what data deduplication pipelines is applied to remove the redundancy? 


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Train database usage #10

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Train database usage #10

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions