Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Web Digging Strategies for Extraction of News


Affiliations
1 Department of Computer Applications, Alagappa University, Karaikudi, Tamil Nadu, India
     

   Subscribe/Renew Journal


The fast extension of the web is creating the consistent development of data, prompting to a few issues, for example, an expanded trouble of extricating conceivably helpful information. Web content mining faces this issue gathering express data from various sites for its get to and learning revelation. Its present techniques concentrate on dissecting static sites and can't manage always showing signs of change sites, for example, news locales. In this paper, a new strategy is proposed for mining on the web news destinations. This strategy applies dynamic plans for investigating these sites and removing news reports. It uses space autonomous measurable examination for pattern investigation. The general technique is the use of web mining technique that goes past direct news examination, attempting to comprehend current society interests and to gauge the social significance of progressing occasions.

Keywords

Web, News Extraction, Really Simple Syndication (RSS).
Subscription Login to verify subscription
User
Notifications
Font Size


  • L. Yi, B. Liu, and X. Li, “Eliminating noisy information in web pages for data mining,” In Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2003.
  • Z. Bar-Yossef, and S. Rajagopalan, “Template detection via data mining and its applications,” In Proceedings of the Eleventh International Conference on World Wide Web, 2002.
  • D. Cai, S. Yu, J. Wen, and W. Ma, “Extracting content structure for web pages based on visual representation,” In Web Technologies and Applications: 5th Asia-Pacific Web Conference (APWeb 2003), 2003.
  • Z. Ji, W. Hsu, and M. L. Lee, “Image mining: Issues, frameworks and techniques,” In Proc. of the 2nd International Workshop on Multimedia Data Mining (MDM/KDD’2001), San Francisco, CA, USA, pp. 13-20, 2001.
  • H. Shinnou, and M. Sasaki, “Automatic extraction of target parts from a web page,” In IPSJ SIG Notes, vol. 2004-NL-162, pp. 33-40, 2004. In Japanese.
  • S. Zheng, R. Song, and J.-R. Wen, “Template independent news extraction based on visual consistency,” In Proceedings of the 22th AAAI Conference on Artificial Intelligence, pp. 1507-1513, 2007.
  • Y. Dong, Q. Li, Z. Yan, and Y. Ding, “A generic web news extraction approach,” In Proceedings of the 2008 IEEE International Conference on Information and Automation, pp. 179-183, 2008.
  • S. Agarwal, A. Singhal, and P. Bedi, “Classification of RSS news items using ontology,” 12th International Conference on Intelligent Systems Design and Applications ISDA, pp. 491-496, 2012.

Abstract Views: 163

PDF Views: 3




  • Web Digging Strategies for Extraction of News

Abstract Views: 163  |  PDF Views: 3

Authors

K. R. Vanishree
Department of Computer Applications, Alagappa University, Karaikudi, Tamil Nadu, India
T. Meyyappan
Department of Computer Applications, Alagappa University, Karaikudi, Tamil Nadu, India

Abstract


The fast extension of the web is creating the consistent development of data, prompting to a few issues, for example, an expanded trouble of extricating conceivably helpful information. Web content mining faces this issue gathering express data from various sites for its get to and learning revelation. Its present techniques concentrate on dissecting static sites and can't manage always showing signs of change sites, for example, news locales. In this paper, a new strategy is proposed for mining on the web news destinations. This strategy applies dynamic plans for investigating these sites and removing news reports. It uses space autonomous measurable examination for pattern investigation. The general technique is the use of web mining technique that goes past direct news examination, attempting to comprehend current society interests and to gauge the social significance of progressing occasions.

Keywords


Web, News Extraction, Really Simple Syndication (RSS).

References