id: 05623293 dt: a an: 05623293 au: Lalitha Devi, Sobha; Kuppan, Sankar; Venkataswamy, Kavitha; Rao, Pattabhi R.K. ti: Identification of similar documents using coherent chunks. so: Lalitha Devi, Sobha (ed.) et al., Anaphora processing and applications. 7th discourse anaphora and anaphor resolution colloquium, DAARC 2009, Goa, India, November 5‒6, 2009. Proceedings. Berlin: Springer (ISBN 978-3-642-04974-3/pbk). Lecture Notes in Computer Science 5847. Lecture Notes in Artificial Intelligence, 54-68 (2009). py: 2009 pu: Berlin: Springer la: EN cc: ut: Coherence; Document similarity; Coreference ci: li: doi:10.1007/978-3-642-04975-0_5 ab: Summary: We focus on automatically finding similar documents using coherent chunks. The similarity between the documents is determined by identifying the coherent chunks present in them. We apply linguistic rules in identifying the coherent chunks and uses Vector Space Model (VSM) in determining the similarity among documents. We have taken patent documents from USPTO for this work. This method of using coherent chunks for identifying similar documents has shown encouraging results. rv: