Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Measurement of Distance from Page Sequences Using Dynamic Programming


Affiliations
1 Department of Computer Applications, Maulana Azad National Institute of Technology, Bhopal, India
2 Department of Computer Applications, Maulana Azad National Institute of Technology, Bhopal, M.P., India
     

   Subscribe/Renew Journal


Internet is playing a vital role for accessing information, because lots of information is available on internet. Lots of data are rapidly growing, but the data which is resided on the web include irrelevant information, it contains different types of data format. Due to heterogeneity of data it is very challenging task to retrieve relevant information from web data. Using web usage mining technique, mine the relevant information from large amount of data available in the web logs format that enclose intrinsic information regarding web pages accessed. Because of this large amount of web log data, it is better to deal with small set of data at a time, instead of handling with whole data jointly. Now we need to find the distance between two user sessions, using some distance similarity function can be accomplish this kind of tasks. Clustering of users tends to establish groups of users exhibiting similar browsing patterns. In this paper we propose novel algorithm, for measuring the similarity between two user sessions based on sequence alignment that uses the Longest Common Subsequence method.

Keywords

Clustering, Longest Common Subsequence, Web Logs, Web Usage Mining.
User
Subscription Login to verify subscription
Notifications
Font Size

Abstract Views: 195

PDF Views: 2




  • Measurement of Distance from Page Sequences Using Dynamic Programming

Abstract Views: 195  |  PDF Views: 2

Authors

Brijesh Bakariya
Department of Computer Applications, Maulana Azad National Institute of Technology, Bhopal, India
Ghanshyam Singh Thakur
Department of Computer Applications, Maulana Azad National Institute of Technology, Bhopal, M.P., India

Abstract


Internet is playing a vital role for accessing information, because lots of information is available on internet. Lots of data are rapidly growing, but the data which is resided on the web include irrelevant information, it contains different types of data format. Due to heterogeneity of data it is very challenging task to retrieve relevant information from web data. Using web usage mining technique, mine the relevant information from large amount of data available in the web logs format that enclose intrinsic information regarding web pages accessed. Because of this large amount of web log data, it is better to deal with small set of data at a time, instead of handling with whole data jointly. Now we need to find the distance between two user sessions, using some distance similarity function can be accomplish this kind of tasks. Clustering of users tends to establish groups of users exhibiting similar browsing patterns. In this paper we propose novel algorithm, for measuring the similarity between two user sessions based on sequence alignment that uses the Longest Common Subsequence method.

Keywords


Clustering, Longest Common Subsequence, Web Logs, Web Usage Mining.