TY - JOUR T1 - Information retrieval on Turkish texts A1 - Can, F. A1 - Kocberber, S. A1 - Balcik, E. A1 - Kaynak, C. A1 - Ocalan, H.C. A1 - Vursavas, O.M. JA - J of American Society for Information Science & Technology Y1 - 2008 VL - 59 SP - 407 EP - 421 N1 - 29.11.2011 M2 - doi: 10.1002/asi.20750 N2 - In this study, we investigate information retrieval (IR) on Turkish texts using a large-scale test collection that contains 408,305 documents and 72 ad hoc queries. We examine the effects of several stemming options and query-document matching functions on retrieval performance. We show that a simple word truncation approach, a word truncation approach that uses language-dependent corpus statistics, and an elaborate lemmatizer-based stemmer provide similar retrieval effectiveness in Turkish IR. We investigate the effects of a range of search conditions on the retrieval performance; these include scalability issues, query and document length effects, and the use of stopword list in indexing. ER -