Using machine learning and text mining in question answering. (English)
Peters, Carol (ed.) et al., Evaluation of multilingual and multi-modal information retrieval. 7th workshop of the cross-language evaluation forum, CLEF 2006, Alicante, Spain, September 20‒22, 2006. Revised selected papers. Berlin: Springer (ISBN 978-3-540-74998-1/pbk). Lecture Notes in Computer Science 4730, 415-423 (2007).
Summary: This paper describes a QA system centered in a full data-driven architecture. It applies machine learning and text mining techniques to identify the most probable answers to factoid and definition questions respectively. Its major quality is that it mainly relies on the use of lexical information and avoids applying any complex language processing resources such as named entity classifiers, parsers and ontologies. Experimental results on the Spanish Question Answering task at CLEF 2006 show that the proposed architecture can be a practical solution for monolingual question answering by reaching a precision as high as 51\%.