Ba-Alwi, Fadl Mutaher and Gaphari, Ghaleb H. and Al-Duqaimi, Fares Nasser (2015) Arabic Text Summarization Using Latent Semantic Analysis. British Journal of Applied Science & Technology, 10 (2). pp. 1-14. ISSN 22310843
Gaphari1022015BJAST17678.pdf - Published Version
Download (463kB)
Abstract
The main objective of this paper is to address Arabic text summarization using latent semantic analysis technique. LSA is a vectorial semantic form of analyzing relationships between a set of sentences. It is concerned with the word description as well as the sentence description for each concept or topic. LSA creates the word by sentence semantic matrix of a document or documents. Each word in the matrix row is represented by word variations such as root, stem and original word. The root is empirically specified as the most effective word representative, where F-score of 63% is obtained at the same time an average ROUGE of 48.5% is obtained too. LSA is implemented along with root representative and different weighting techniques then the optimal combination is specified and used as a proposed summarizer for Arabic Text Summarization. Then the summarizer is implemented again, where the input documents are pre-processed by POS tagger. The summarizer performance and effectiveness are measured manually and automatically based on the summarization accuracy. Experimental results show that the summarizer obtains higher level of accuracy as compared to human summarizer. When the compression rate is 25% F-scores of 68% is obtained and an average ROUGE score of 59% is obtained as well, in terms of Arabic text summarization.
Item Type: | Article |
---|---|
Subjects: | Archive Paper Guardians > Multidisciplinary |
Depositing User: | Unnamed user with email support@archive.paperguardians.com |
Date Deposited: | 27 Jan 2024 04:15 |
Last Modified: | 27 Jan 2024 04:15 |
URI: | http://archives.articleproms.com/id/eprint/1194 |