Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Research on Ranking Algorithms in Web Structure Mining


Affiliations
1 Mathematics and Computer Applications Department, Maulana Azad National Institute of Technology, Bhopal, Madhya Pradesh, India
     

   Subscribe/Renew Journal


In the past few decades, the Web has emerged as a treasure of information and web mining is a technique to handle this treasure. During recent years web mining has been a well-researched area. Web mining is the application of the data mining which is useful to extract the knowledge from web. With the progress of web, more and more data are now available for users on web. Web structure mining deals with the contents and hyperlinks on web pages.In this review paper, we have focused on three basic algorithms for evaluating the importance of pages i.e. Page Rank, Weighted Page Rrank, and Hyperlink-Induced Topic Search and comparison of those algorithms. PageRank algorithm is based on back links of the page and it calculates the rank of web pages at indexing time. Weighted Page Rank algorithm scores pages according to their relevancies and rank of a page is calculated by its number of incoming and outgoing links. Hyperlink-induced topic search algorithm is an iterative algorithm developed to quantify each page's value as an authority and as a hub. This study was done basically to explore the link structure algorithms for ranking pages.

Keywords

Web Mining, Web Structure Mining, Page Rank, Weighted Page Rank, Hyperlink-Induced Topic Search.
Subscription Login to verify subscription
User
Notifications
Font Size


  • da Costa Júnior, M. G. & Gong, T. Z. (2005). Web structure mining: An introduction. Proceedings of the 2005 IEEE International Conference on Information Acquisition.
  • Duhan, N., Sharma, A. K., & Bhatia, K. K. (2009). Page ranking algorithms: A survey. 2009 IEEE International Advance Computing Conference (IACC 2009).
  • Eirinaki, M. (2004). Web Mining: A Roadmap. Department of Informatics Athens University of Economics and Business.
  • Getoor, L., & Diehl, C. P. (2005). Link mining: A survey. ACM SIGKDD Explorations Newsletter, December, 7(2), 64-71.
  • Kleinberg, J. K. (1999). Authoritative sources in a hyperlinked environment. Journal of the ACM, September, 46(5), 604-632.
  • Kosala, R., & Blockeel, H. (2000). Web Mining Research: A Survey. ACM SIGKDD Explorations Newsletter, July, 2(1), 1-15.
  • Liu, L., Chen, J., & Song, H. (2001). The research of web mining. Proceedings of the 4th World Congress on Intelligent Control and Automation, June, (pp. 10-14).
  • Page, L., Brin, S., Motwani, R., & Winograd, T. (2002). The Pagerank Citation Ranking: Bringing order to the Web. Technical Report, Stanford Digital Libraries.
  • Ridings, C., & Shishigin, M. (2002). Pagerank Uncovered. Technical Report, 2002.
  • Sangeetha, M., & Joseph, K. S. (2015). Page ranking algorithms used in web mining .ICICES 2014 - S. A. Engineering College, Chennai, Tamil Nadu, India.
  • Shivakumar, B. L. & Mylsami, T. (2014). Survey on web structure mining. ARPN Journal of Engineering and Applied Sciences, October, 9(10), 1914-1923.
  • Xing, W., & Ghorbani, A. (2004). Weighted pagerank algorithm. Technical report, 2 Proceedings of the Second Annual Conference on Communication Networks and Services Research (CNSR’04) 2004 IEEE.
  • Yan, L., Wei, Y., Gui, Z., & Chen, Y. (2011). Research on pagerank and hyperlink-induced topic search in web structure mining. International Conference on Internet Technology and Applications.
  • Directed Graph. Retrieved from https://en.wikipedia.org.

Abstract Views: 211

PDF Views: 2




  • Research on Ranking Algorithms in Web Structure Mining

Abstract Views: 211  |  PDF Views: 2

Authors

Suvarna Sharma
Mathematics and Computer Applications Department, Maulana Azad National Institute of Technology, Bhopal, Madhya Pradesh, India
Amit Bhagat
Mathematics and Computer Applications Department, Maulana Azad National Institute of Technology, Bhopal, Madhya Pradesh, India

Abstract


In the past few decades, the Web has emerged as a treasure of information and web mining is a technique to handle this treasure. During recent years web mining has been a well-researched area. Web mining is the application of the data mining which is useful to extract the knowledge from web. With the progress of web, more and more data are now available for users on web. Web structure mining deals with the contents and hyperlinks on web pages.In this review paper, we have focused on three basic algorithms for evaluating the importance of pages i.e. Page Rank, Weighted Page Rrank, and Hyperlink-Induced Topic Search and comparison of those algorithms. PageRank algorithm is based on back links of the page and it calculates the rank of web pages at indexing time. Weighted Page Rank algorithm scores pages according to their relevancies and rank of a page is calculated by its number of incoming and outgoing links. Hyperlink-induced topic search algorithm is an iterative algorithm developed to quantify each page's value as an authority and as a hub. This study was done basically to explore the link structure algorithms for ranking pages.

Keywords


Web Mining, Web Structure Mining, Page Rank, Weighted Page Rank, Hyperlink-Induced Topic Search.

References