Ai Feedback
Exact(7)
A threshold point at which the trend in the statistics for the total number and distribution of coevolving pairs appeared to change could be recognized in the MSA with 86% maximal sequence identity, which consisted of only 165 sequences (Figure S2, MSA S3, Supporting Information).
Maximal sequence identity to any BTV ranged from 63% (segment 2) to 79% (segments 7 and 10).
From the histogram shown in Figure 3, we can observe that the median value of the maximal sequence identity to training enzymes is only 25%.
To reduce this second source of bias, we evaluate the performance of EFICAz according to different levels of maximal sequence identity of the test sequences to the training enzymes.
Achieving this level of precision is not trivial, considering that: (i) hypothetical proteins are the most difficult targets for automated function prediction [ 77], and (ii) the maximal sequence identity between the 64 hypothetical proteins and the EFICAz training enzymes has a median value of 25%.
To exclude cases in which the transfer of functional annotation could be successfully achieved in most cases by simple sequence similarity based methods, we only consider hypothetical proteins whose maximal sequence identity to any of the enzymes we used to train EFICAz is less than 60%.
Similar(53)
For alignments longer than 250 residues, HVAL < 0 implies that the maximal pairwise sequence identity was 20% (Rost, 1999).
Within the best 100 matches, the maximal DNA sequence identity was in the range of 100 - 96%.
Redundancy was removed with cd-hit at a level of maximal 90% sequence identity [ 20], the remaining sequences were aligned with MAFFT (using L-INS-I settings [ 21]) and the resulting multiple alignment was visualized and annotated in Jalview [ 22].
The second set (dubbed "hval0") contained 3,767 chains; it resulted from filtering at HVAL>0 [ 2, 3, 13] (corresponding to ~20% maximal pairwise sequence identity for alignments over 250 residues).
The average recall of EFICAz depends of the specific maximal testing to training sequence identity interval.
Related(20)
upper sequence identity
high sequence identity
top sequence identity
possible sequence identity
full sequence identity
maximizing sequence identity
greatest sequence identity
best sequence identity
highest sequence identity
maximal sequence length
maximal sequence colinearity
maximal sequence diversity
maximal sequence similarity
maximal sequence divergence
maximal percent identity
maximal sequence alignment
maximal sequence discrimination
maximal sequence information
maximal sequence variation
maximum sequence identity
Write better and faster with AI suggestions while staying true to your unique style.
Since I tried Ludwig back in 2017, I have been constantly using it in both editing and translation. Ever since, I suggest it to my translators at ProSciEditing.
Justyna Jupowicz-Kozak
CEO of Professional Science Editing for Scientists @ prosciediting.com