Please fill in your query. A complete syntax description you will find on the General Help page.
Algorithm for grounding mutation mentions from text to protein sequences. (English)
Lambrix, Patrick (ed.) et al., Data integration in the life sciences. 7th international conference, DILS 2010, Gothenburg, Sweden, August 25‒27, 2010. Proceedings. Berlin: Springer (ISBN 978-3-642-15119-4/pbk). Lecture Notes in Computer Science 6254. Lecture Notes in Bioinformatics, 122-131 (2010).
Summary: Protein mutations derived from in vitro experimental analysis are described in detail in scientific papers. Reuse of mutation impact annotations is an important subfield of bioinformatics for which mutation grounding is a critical step. Presented here is a method for grounding of textual mentions from papers describing mutational changes to proteins. We distinguish between grounding of mutation entities to protein database identifiers and to the correct positions on sequences extracted from protein databases. The grounding workflow coordinates the extraction of mutation, protein and organism mentions from texts and uses these to identify target sequences. Mutation mentions are sequentially mapped onto candidate proteins to facilitate their correct grounding to a protein sequence, independent of a protein-mutation tuple extraction task. Using a gold standard corpus of full text articles and corresponding protein sequences we show high performance precision and recall and discuss novel aspects of the algorithm in the context of previous work.
WorldCat.org
Valid XHTML 1.0 Transitional Valid CSS!