TY  - JOUR
T1  - Information retrieval on Turkish texts
A1  - Can, F.
A1  - Kocberber, S.
A1  - Balcik, E.
A1  - Kaynak, C.
A1  - Ocalan, H.C.
A1  - Vursavas, O.M.
JA  - J of American Society for Information Science & Technology
Y1  - 2008
VL  - 59
SP  - 407
EP  - 421
N1  - 29.11.2011
M2  - doi: 10.1002/asi.20750
N2  - In this study, we investigate information retrieval (IR) on
Turkish texts using a large-scale test collection that contains
408,305 documents and 72 ad hoc queries. We
examine the effects of several stemming options and
query-document matching functions on retrieval performance.
We show that a simple word truncation approach, a
word truncation approach that uses language-dependent
corpus statistics, and an elaborate lemmatizer-based stemmer
provide similar retrieval effectiveness in Turkish IR. We
investigate the effects of a range of search conditions on
the retrieval performance; these include scalability issues,
query and document length effects, and the use of stopword
list in indexing.
ER  -