Paper Title :Language Normalisation Of Noisy Text Data
Author :Akash Mohanty, Santosh Kumar Majhi, Sampa Chaupattnaik
Article Citation :Akash Mohanty ,Santosh Kumar Majhi ,Sampa Chaupattnaik ,
(2013 ) " Language Normalisation Of Noisy Text Data " ,
International Journal of Advance Computational Engineering and Networking (IJACEN) ,
pp. 36-39,
Volume-1,Issue-3
Abstract : This paper addresses the issue of language normalization, an important problem in natural language processing.
Facebook, Twitter provides access to large volumes of data in real time, but is notoriously noisy, hampering its utility for
NLP. By language normalization, we mean converting ‘informally inputted’ text into the structured form, by removing
‘noises’ in the text. It includes detection of ill-formed words, detecting paragraph and sentence boundaries in the text.
Previously, text normalization issues were often undertaken in an ad-hoc fashion or studied separately. This paper first gives
a formalization of the entire problem. It then proposes a Knowledge based approach to perform to make the text data errorfree.
Type : Research paper
Published : Volume-1,Issue-3
DOIONLINE NO - IJACEN-IRAJ-DOIONLINE-35
View Here
Copyright: © Institute of Research and Journals
|
|
| |
|
PDF |
| |
Viewed - 29 |
| |
Published on 2014-01-18 |
|