• CLARIN-PL Repository Home
  • View Item
  •  
  •   What can you do?
  •   Browse  
    •    All of the Repository  
      •   Issue Date
      •   Authors
      •   Titles
      •   Subjects
      •   Publisher
      •   Language
      •   Type
      •   Rights Label
  •   My Account  
    •    Login via Your home institution
    •    Register
  •   Statistics  
    •    StatisticsBETA
  •   General Information  
    •    Deposit
    •    Cite
    •    Submission Lifecycle
    •    FAQ
    •    About and Policies
    •    Help Desk
 
 

Assamese Stopwords

 
CLARIN-PL
  Authors
Sarma, Prof. Shikhar Kr.
 Date issued
2019-01-08
 Type
lexicalConceptualResource
 Size
264 words
 Language(s)
Assamese
 Description
The most frequently occurring words in a context are the stopwords. They do not play an important role in retrieving information. As Stopwords do not contribute any important information towards the context and so they should be removed before processing. These words have very low discrimination value and are sometimes referred to as noise words. Assamese stopword list is created which contains 264 words. Examples are: যেতিয়া, যেন, যেনিবা, যেনে, যোগে, লগ, লৈ etc. --- 1. These Assamese NLP resources including the Tools and Applications are developed during Research and Development Projects as well as Masters and Ph.D. thesis works. 2. These are mainly developed or generated at Gauhati University Department of Computer Science and Department of Information Technology. 3. These resources are used by students and researchers for further studies, researches, as well as for design and development of tools and applications. 4. Computational Linguistics in Assamese is not rich, and Natural Language Processing works have mainly started during last two decades, and most of the resources are first generation resources, and with ample scope for upgrading, enriching, and purifying. 5. These are very good and essential resources for all the researchers in Assamese NLP, as the language requires more and more NLP works to make Assamese a rich media for the digital world. 6. Anyone interested, or in need of such resources may express their interest for the required resources, and the way of availability will be advised/informed accordingly. 7. These are purely research materials and could only be used for further research only. 8. Researchers may visit the NLP Lab of Department of Information Technology, Gauhati University, Guwahati, India or contact us. 9. Researchers interested in collaborative works, and also students for project works, are welcome. 10. Contact person is Professor Shikhar Kr. Sarma, Department of Information Technology, Gauhati University, Guwahati 781014, Assam, India. Email- sks@gauhati.ac.in
 Publisher
Department of Information Technology, Gauhati University, Assam, India
 Acknowledgement

Department of Electronics and IT, Govt. of India

Project code: CLIA Consortia

Project name: CLIA-Assamese

 Subject(s)
Assamese NLP Assamese stopwords Assamese noise words Gauhati University
 Collection(s)
Assamese NLP Resources
Show full item record
 
 
  • © 2024 CLARIN-PL. All Rights Reserved.
  • Base on DSpace modified by UFAL MFF UK and CLARIN-PL
  • Privacy policy | Licenses