Applied Text Analytics for Comments on News-Articles

Anne Schuth. 2007.


Several on-line daily newspapers offer readers the opportunity to directly comment on articles. In the Netherlands this feature is used quite often and the quality (grammatically and content-wise) is surprisingly high. The paper develops techniques to collect, store, enrich and analyze these comments. After giving a high-level overview of the Dutch ‘commentosphere’ we zoom in on extracting the discussion structure found in flat comment threads; people not only comment on the news article, they also heavily comment on other comments, resembling discussion fora. We show how techniques from information retrieval, natural language processing and machine learning can be used to extract the ‘reacts-on’ relation between comments with remarkably high precision and recall.


Applied Text Analytics for Comments on News-Articles


  title = {Applied Text Analytics for Comments on News-Articles},
  author = {Anne Schuth},
  year = {2007}