arabic corpus online with Winfy

We have hosted the application arabic corpus in order to run this application in our online workstations with Wine or directly.


Quick description about arabic corpus:

The Arabic Corpus {compiled by Dr. Mourad Abbas ( http: //sites.google.com/site/mouradabbas9/corpora ) The corpus Khaleej-2004 contains 5690 documents. It is divided to 4 topics (categories). The corpus Watan-2004 contains 20291 documents organized in 6 topics (categories). Researchers who use these two corpora would mention the two main references:
(1) For Watan-2004 corpus
----------------------
M. Abbas, K. Smaili, D. Berkani, (2011) Evaluation of Topic Identification Methods on Arabic Corpora,JOURNAL OF DIGITAL INFORMATION MANAGEMENT,vol. 9, N. 5, pp.185-192.

2) For Khaleej-2004 corpus
---------------------------------
M. Abbas, K. Smaili (2005) Comparison of Topic Identification Methods for Arabic Language, RANLP05 : Recent Advances in Natural Language Processing ,pp. 14-17, 21-23 september 2005, Borovets, Bulgary.

More useful references to check:
-------------------------------------------
https: //sites.google.com/site/mouradabbas9/corpora.

Audience: Information Technology, Science/Research, Advanced End Users, Developers, Quality Engineers, Engineering.
User interface: Win32 (MS Windows), KDE.
Programming Language: Python, C++, JavaScript.
Database Environment: MySQL.

Categories:
Machine Translation, Machine Learning

©2024. Winfy. All Rights Reserved.

By OD Group OU – Registry code: 1609791 -VAT number: EE102345621.