An Innovative Automatic Indexing Method For Arabic Text

Ramzi A. Haraty; Sanaa  Kaddoura; Sultan  Al Jahdali; Nour K.  Masri

download PDF

Published:

Nov 21, 2023

Keywords:

Arabic Text, Automatic Indexing, Building Thesaurus, Frequent Sets, Synonyms

Issue

Vol. 3 No. 1 (2023)

Section

Articles

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

ACE journal provides immediate open access to all content on the principle that making research freely available to the public supports a greater global exchange of knowledge. ACE grants usage rights to others using the open license CC-BY-NC allowing for immediate free access to the work and permitting any user to read, download, copy, distribute, print, search, or link to the full texts of articles, crawl them for indexing, pass them as data to software, or use them for any other lawful purpose.

Ramzi A. Haraty

Sanaa Kaddoura

Sultan Al Jahdali

Nour K. Masri

Abstract

The study of automatic indexing and text retrieval methods for language has a long history. Automatic indexing involves extracting words from a document to categorize it based on subject matter and to improve the information retrieval process. Despite extensive research in other languages, there remains limited investigation into automated Arabic text categorization. In this research, the researchers introduce an innovative method to enhance the accuracy of automatic indexing of Arabic texts by incorporating a thesaurus. Their approach extracts new relevant words by referencing thesaurus, which contains words, synonyms, and correlations identified through its construction using a natural language toolkit and a WordNet library. Synonyms with similar meanings that frequently appear together are grouped using a JavaScript Object Notation dictionary. The research results demonstrate a significant improvement in accuracy and efficiency compared to prior studies.

Advances in Computing and Engineering
Journal / Advances in Computing and Engineering / Vol. 3 No. 1 (2023) / Articles

Published:

Keywords:

An Innovative Automatic Indexing Method For Arabic Text

Ramzi A. Haraty

Sanaa Kaddoura

Sultan Al Jahdali

Nour K. Masri

Abstract

Journal Identifiers

Article Sidebar

Published:

Keywords:

Article Details

Main Article Content

Ramzi A. Haraty

Sanaa Kaddoura

Sultan Al Jahdali

Nour K. Masri

Abstract

Journal Identifiers