Suggestions(1)
Exact(1)
Totally, 630,618 transcripts distributed in 116,653 trinity components (multiple alternatively spliced transcripts from a gene locus) were obtained with average length of 1,454 bp and N50 length of 2,100 bp, and the longest transcript of each trinity component was selected as representative for the construction of wheat unigenes dataset (Additional file 2).
Similar(59)
To avoid this redundancy in subsequent sequence and expression analyses, a reference transcriptome assembly was created by selecting one representative transcript for each Trinity subcomponent.
This is to be expected due to the clustering of multiple transcripts into each Trinity component.
We calculated a score for each of the remaining 53,071 putative transcripts of the Trinity assembly based on (a) similarity to known sequences, (b) features of any predicted ORFs, and (c) sequencing coverage along the putative transcript.
Transcripts of the same trinity component could be treated as different sequence isoforms.
Finally, we estimated the abundance of each transcript using the Trinity program align_and_estimate_abundance.pl, which uses Bowtie v1.0.1 to align reads (Grabherr et al. 2011).
Similarly, only keeping the highest covered transcripts for each subcomponent (Trinity pickH) further reduced reference coverage, redundancy, and chimera rate.
Putative orthologs between the mouse and Trinity assembled transcriptomes (each Trinity transcriptome contains a set of Trinity Transcripts) were identified by a bidirectional best-hit method by discontiguous MegaBLAST.
The Trinity assembler clusters all transcripts and alternatively-spliced variants of the same gene into components, therefore expression levels were also estimated for every gene in the transcriptome by estimating the abundance of all of the sequencing reads that mapped to all of the transcripts within a Trinity component in the assembly.
To further refine the candidate transcripts, we extracted the raw reads that mapped to those 184 loci, processed a de novo transcript assembly through Trinity (Grabherr et al., 2011) and determined the Coding Potential Calculator (CPC) score of each transcript (Kong et al., 2007) to identify 64 novel de novo lncRNAs.
① All of the transcripts from Trinity were aligned to GASS's transcript-sequences with BLAST.
Write better and faster with AI suggestions while staying true to your unique style.
Since I tried Ludwig back in 2017, I have been constantly using it in both editing and translation. Ever since, I suggest it to my translators at ProSciEditing.

Justyna Jupowicz-Kozak
CEO of Professional Science Editing for Scientists @ prosciediting.com