We have hosted the application arabic corpus in order to run this application in our online workstations with Wine or directly.


Quick description about arabic corpus:

The Arabic Corpus {compiled by Dr. Mourad Abbas ( http: //sites.google.com/site/mouradabbas9/corpora ) The corpus Khaleej-2004 contains 5690 documents. It is divided to 4 topics (categories). The corpus Watan-2004 contains 20291 documents organized in 6 topics (categories). Researchers who use these two corpora would mention the two main references:
(1) For Watan-2004 corpus
----------------------
M. Abbas, K. Smaili, D. Berkani, (2011) Evaluation of Topic Identification Methods on Arabic Corpora,JOURNAL OF DIGITAL INFORMATION MANAGEMENT,vol. 9, N. 5, pp.185-192.

2) For Khaleej-2004 corpus
---------------------------------
M. Abbas, K. Smaili (2005) Comparison of Topic Identification Methods for Arabic Language, RANLP05 : Recent Advances in Natural Language Processing ,pp. 14-17, 21-23 september 2005, Borovets, Bulgary.

More useful references to check:
-------------------------------------------
https: //sites.google.com/site/mouradabbas9/corpora.

Audience: Information Technology, Science/Research, Advanced End Users, Developers, Quality Engineers, Engineering.
User interface: Win32 (MS Windows), KDE.
Programming Language: Python, C++, JavaScript.
Database Environment: MySQL.

Categories:
Machine Translation, Machine Learning

Page navigation:

©2024. Winfy. All Rights Reserved.

By OD Group OU – Registry code: 1609791 -VAT number: EE102345621.