Hide metadata

dc.contributor.authorAdzhubei, Alexei A
dc.contributor.authorVlasova, Anna V
dc.contributor.authorHagen-Larsen, Heidi
dc.contributor.authorRuden, Torgeir A
dc.contributor.authorLaerdahl, Jon K
dc.contributor.authorHøyheim, Bjørn
dc.date.accessioned2015-10-09T02:08:58Z
dc.date.available2015-10-09T02:08:58Z
dc.date.issued2007
dc.identifier.citationBMC Genomics. 2007 Jul 02;8(1):209
dc.identifier.urihttp://hdl.handle.net/10852/46600
dc.description.abstractBackground To identify as many different transcripts/genes in the Atlantic salmon genome as possible, it is crucial to acquire good cDNA libraries from different tissues and developmental stages, their relevant sequences (ESTs or full length sequences) and attempt to predict function. Such libraries allow identification of a large number of different transcripts and can provide valuable information on genes expressed in a particular tissue at a specific developmental stage. This data is important in constructing a microarray chip, identifying SNPs in coding regions, and for future identification of genes in the whole genome sequence. An important factor that determines the usefulness of generated data for biologists is efficient data access. Public searchable databases play a crucial role in providing such service. Description Twenty-three Atlantic salmon cDNA libraries were constructed from 15 tissues, yielding nearly 155,000 clones. From these libraries 58,109 ESTs were generated, of which 57,212 were used for contig assembly. Following deletion of mitochondrial sequences 55,118 EST sequences were submitted to GenBank. In all, 20,019 unique sequences, consisting of 6,424 contigs and 13,595 singlets, were generated. The Norwegian Salmon Genome Project Database has been constructed and annotation performed by the annotation transfer approach. Annotation was successful for 50.3% (10,075) of the sequences and 6,113 sequences (30.5%) were annotated with Gene Ontology terms for molecular function, biological process and cellular component. Conclusion We describe the construction of cDNA libraries from juvenile/pre-smolt Atlantic salmon (Salmo salar), EST sequencing, clustering, and annotation by assigning putative function to the transcripts. These sequences represents 97% of all sequences submitted to GenBank from the pre-smoltification stage. The data has been grouped into datasets according to its source and type of annotation. Various data query options are offered including searches on function assignments and Gene Ontology terms. Data delivery options include summaries for the datasets and their annotations, detailed self-explanatory annotations, and access to the original BLAST results and Gene Ontology annotation trees. Potential presence of a relatively high number of immune-related genes in the dataset was shown by annotation searches.
dc.language.isoeng
dc.rightsAdzhubei et al.
dc.rightsAttribution 2.0 Generic
dc.rights.urihttp://creativecommons.org/licenses/by/2.0/
dc.titleAnnotated Expressed Sequence Tags (ESTs) from pre-smolt Atlantic salmon (Salmo salar) in a searchable data resource
dc.typeJournal article
dc.date.updated2015-10-09T02:08:58Z
dc.creator.authorAdzhubei, Alexei A
dc.creator.authorVlasova, Anna V
dc.creator.authorHagen-Larsen, Heidi
dc.creator.authorRuden, Torgeir A
dc.creator.authorLaerdahl, Jon K
dc.creator.authorHøyheim, Bjørn
dc.identifier.doihttp://dx.doi.org/10.1186/1471-2164-8-209
dc.identifier.urnURN:NBN:no-50786
dc.type.documentTidsskriftartikkel
dc.type.peerreviewedPeer reviewed
dc.identifier.fulltextFulltext https://www.duo.uio.no/bitstream/handle/10852/46600/1/12864_2007_Article_922.pdf
dc.type.versionPublishedVersion
cristin.articleid209


Files in this item

Appears in the following Collection

Hide metadata

Attribution 2.0 Generic
This item's license is: Attribution 2.0 Generic