Irandoc Corpura Portal

‌Corpus Features

Very Specialized Writings

The text body of Irandak has nearly four million and 780 thousand words. The content of this corpus is not universal and has very specialized and interdisciplinary writings (such as librarianship and information, information technology, knowledge management, information science and epistemology, computational linguistics, terminology and the like).

Effective Search

In information retrieval, in addition to displaying the search word or phrase in the linguistic context, the name of the article in which that word or phrase is used, the subject of the article, the author(s) of the article, and the frequency of the search word or phrase are also displayed.

Comprehensive Tags

The corpora have tags of lexical parts of speech (POS tag) that are used in language processing. These tags specify the categories of words (such as nouns, adjectives, adverbs, etc.).

About

‌Corpus Features

Very Specialized Writings

Effective Search

Comprehensive Tags