A Novel Stemming Algorithm for Albanian: a Data Mining Approach for Document Classification in  Albanian - Jetmir Sadiku - Bøker - LAP LAMBERT Academic Publishing - 9783659194467 - 24. juli 2012
Ved uoverensstemmelse mellom cover og tittel gjelder tittel

A Novel Stemming Algorithm for Albanian: a Data Mining Approach for Document Classification in Albanian

Jetmir Sadiku

Pris
S$ 60,50

Bestillingsvarer

Forventes levert 16. - 24. jul
Legg til iMusic ønskeliste
Eller

A Novel Stemming Algorithm for Albanian: a Data Mining Approach for Document Classification in Albanian

This book deals with the design and building a stemming algorithm for the Albanian language and than using it to classify a corpus of documents. The work is based on research on stemming algorithms of other languages and the morphology of Albanian. Text Mining is a knowledge-intensive technique that is used to interact with a collection of documents by employing a set of analysis tools. Data/Text Mining (data can be text) is becoming a very useful process today for gathering information based on stored data. The most useful fields where data mining helps most are medicine, banking, finance, marketing, spam filtering etc. A stemming algorithm is a procedure that removes the suffixes from the words providing the root (stem) of the words. Stemming is needed in search engines to reduce the number of words with the same stem giving a reduced number of indexes. This book represents a first set of rules for Albanian that will be used in a stemming algorithm and for the first time, a list of stopwords of Albanian will be represented.

Media Bøker     Pocketbok   (Bok med mykt omslag og limt rygg)
Utgitt 24. juli 2012
ISBN13 9783659194467
Utgivere LAP LAMBERT Academic Publishing
Antall sider 104
Mål 150 × 6 × 226 mm   ·   173 g
Språk Tysk