<HashMap><database>biostudies-literature</database><scores/><additional><omics_type>Unknown</omics_type><volume>36(10)</volume><submitter>Perovic V</submitter><pubmed_abstract>&lt;h4>Motivation&lt;/h4>Proteins containing tandem repeats (TRs) are abundant, frequently fold in elongated non-globular structures and perform vital functions. A number of computational tools have been developed to detect TRs in protein sequences. A blurred boundary between imperfect TR motifs and non-repetitive sequences gave rise to necessity to validate the detected TRs.&lt;h4>Results&lt;/h4>Tally-2.0 is a scoring tool based on a machine learning (ML) approach, which allows to validate the results of TR detection. It was upgraded by using improved training datasets and additional ML features. Tally-2.0 performs at a level of 93% sensitivity, 83% specificity and an area under the receiver operating characteristic curve of 95%.&lt;h4>Availability and implementation&lt;/h4>Tally-2.0 is available, as a web tool and as a standalone application published under Apache License 2.0, on the URL https://bioinfo.crbm.cnrs.fr/index.php? route=tools&amp;tool=27. It is supported on Linux. Source code is available upon request.&lt;h4>Supplementary information&lt;/h4>Supplementary data are available at Bioinformatics online.</pubmed_abstract><journal>Bioinformatics (Oxford, England)</journal><pagination>3260-3262</pagination><full_dataset_link>https://www.ebi.ac.uk/biostudies/studies/S-EPMC7214015</full_dataset_link><repository>biostudies-literature</repository><pubmed_title>Tally-2.0: upgraded validator of tandem repeat detection in protein sequences.</pubmed_title><pmcid>PMC7214015</pmcid><pubmed_authors>Veljkovic N</pubmed_authors><pubmed_authors>Sumonja N</pubmed_authors><pubmed_authors>Richard FD</pubmed_authors><pubmed_authors>Kajava AV</pubmed_authors><pubmed_authors>Leclercq JY</pubmed_authors><pubmed_authors>Perovic V</pubmed_authors></additional><is_claimable>false</is_claimable><name>Tally-2.0: upgraded validator of tandem repeat detection in protein sequences.</name><description>&lt;h4>Motivation&lt;/h4>Proteins containing tandem repeats (TRs) are abundant, frequently fold in elongated non-globular structures and perform vital functions. A number of computational tools have been developed to detect TRs in protein sequences. A blurred boundary between imperfect TR motifs and non-repetitive sequences gave rise to necessity to validate the detected TRs.&lt;h4>Results&lt;/h4>Tally-2.0 is a scoring tool based on a machine learning (ML) approach, which allows to validate the results of TR detection. It was upgraded by using improved training datasets and additional ML features. Tally-2.0 performs at a level of 93% sensitivity, 83% specificity and an area under the receiver operating characteristic curve of 95%.&lt;h4>Availability and implementation&lt;/h4>Tally-2.0 is available, as a web tool and as a standalone application published under Apache License 2.0, on the URL https://bioinfo.crbm.cnrs.fr/index.php? route=tools&amp;tool=27. It is supported on Linux. Source code is available upon request.&lt;h4>Supplementary information&lt;/h4>Supplementary data are available at Bioinformatics online.</description><dates><release>2020-01-01T00:00:00Z</release><publication>2020 May</publication><modification>2022-02-09T18:05:03.851Z</modification><creation>2022-02-09T18:05:03.851Z</creation></dates><accession>S-EPMC7214015</accession><cross_references><pubmed>32096820</pubmed><doi>10.1093/bioinformatics/btaa121</doi></cross_references></HashMap>