Developing Lexico-Semantic Relations of Saraiki Nouns: A Corpus-Based Study

Authors

  • Musarat Nazeer M.Phil. Scholar, Department of English, University of Sargodha, Sargodha, Punjab, Pakistan Author
  • Musarrat Azher Associate Professor, Department of Linguistics and Language Studies, University of Sargodha, Pakistan Author
  • Azhar Pervaiz Assistant Professor, Department of Linguistics and Language Studies, University of Sargodha, Pakistan Author
  • Iqra Yasmeen Mphil Scholar, Department of English, University of Sargodha, Sargodha, Pakistan Author

DOI:

https://doi.org/10.33195/

Keywords:

Corpus-based Study, Saraiki Nouns , Lexico-semantic Relations , WordNet, NLP

Abstract

Saraiki, being the fourth most widely spoken language in Pakistan and being used in some parts of India and Afghanistan, is of significant geographical, historical, and cultural importance. However, it remains neglected in terms of proper documentation and identification of its unique linguistic features. The current study is centered on identifying the lexico-semantic categories of Saraiki nouns and then developing their hierarchical relationships (Miller et al., 1993). This quantitative research is designed to contribute to the process of developing Saraiki WordNet and is related to Natural Language Processing (NLP). A corpus of 3 million words was developed on the basis of data collected from different genres of the Saraiki language, including newspapers, academic essays, literary texts, and religious books. Both expansion and merge approaches were used to analyze the data. A wordlist of 1500 most occurring nouns was extracted from the corpus using Antconc 3.4.4.0, followed by manual tagging in Microsoft Excel 2010. Resultantly, 39 most occurring nouns from the wordlist were used to develop 173 related synsets, and lexico-semantic relationships among these nouns were identified with the help of 30 hierarchies (Miller et al., 1993). This study is limited to areas like Bahawalpur, Multan, and Muzaffarabad. It would be a milestone for Saraiki language learners, SWN development, Saraiki lexical resources, online SL dictionaries, and a guide for researchers. 

Downloads

Published

04/08/2024

How to Cite

Nazeer, M., Musarrat Azher, Azhar Pervaiz, & Iqra Yasmeen. (2024). Developing Lexico-Semantic Relations of Saraiki Nouns: A Corpus-Based Study. University of Chitral Journal of Linguistics and Literature, 8(I), 162-182. https://doi.org/10.33195/

Similar Articles

1-10 of 209

You may also start an advanced similarity search for this article.