Results (2)
Search Parameters:
Keyword: LemmatizationSentence Retrieval using Stemming and Lemmatization with Different Length of the Queries
Advances in Science, Technology and Engineering Systems Journal,
Volume 5,
Issue 3,
Page # 349–354,
2020;
DOI: 10.25046/aj050345
Abstract:
In this paper we focus on Sentence retrieval which is similar to Document retrieval but with a smaller unit of retrieval. Using data pre-processing in document retrieval is generally considered useful. When it comes to sentence retrieval the situation is not that clear. In this paper we use TF-ISF (term frequency – inverse sentence frequency)…
Read More(This article belongs to Section Interdisciplinary Applications of Computer Science (CSI))
Four-Dimensional Sparse Data Structures for Representing Text Data
by Martin Marinov and Alexander Efremov
Advances in Science, Technology and Engineering Systems Journal,
Volume 5,
Issue 5,
Page # 154–166,
2020;
DOI: 10.25046/aj050521
Abstract:
This paper focuses on a string encoding algorithm, which produces sparse distributed representations of text data. A characteristic feature of the algorithm described here, is that it works without tokenizing the text and can avoid other data preparation steps, such as stemming and lemmatization. The text can be of arbitrary size, whether it is a…
Read More(This article belongs to the SP9 (Special Issue on Multidisciplinary Innovation in Engineering Science & Technology 2020) & Section Interdisciplinary Applications of Computer Science (CSI))
