Refine your search
Collections
Co-Authors
Journals
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z All
Salim, Naomie
- Hybridization of Bag-of-Words and Forum Metadata for Web Forum Question Post Detection
Abstract Views :128 |
PDF Views:0
Authors
Affiliations
1 Department of Computer Science, College of Science and Technology, Kaduna Polytechnic, P.M.B 2021, Kaduna, NG
2 Faculty of Computing, Universiti Teknologi Malaysia, 81310, Skudai, Johor, MY
1 Department of Computer Science, College of Science and Technology, Kaduna Polytechnic, P.M.B 2021, Kaduna, NG
2 Faculty of Computing, Universiti Teknologi Malaysia, 81310, Skudai, Johor, MY
Source
Indian Journal of Science and Technology, Vol 8, No 32 (2015), Pagination:Abstract
Background/Objective:A web forum is a problem-solving online community.Web forum research activitieshave been focused on answer mining with the assumption that the starting post is a question post. This paper proposes methods for mining standard web forum questions. Methods/Statistical Analysis:Popular methods for web forum question post detection are question mark, question words, higher n-grams and sequential pattern mining. These methods have problem of low detection rate and implementation complexity. Implemented in this paper is hybridization of simple bag-of-words model with web forum metadata, simple rule of question mark and question words. Dimensional reduction was performed using chi-square and wrapper techniques. Findings:The quality of web forum question posts varies from excellent to mediocre or even spam. Detecting good question posts is non-trivial. It requires utilization of salient features. Combination of simple rule of question mark and question words with forum metadata performed better than each of the two.Integration of bag-of-words model with simple rule of question marks, question words and forum metadata enhances question post detection. Dimensionality reduction using chi-square were found to perform better than other popular filters like info gain, gain ratio and symmetric uncertain. Applications/Improvements: Three publicly available datasets of varying technical degrees were used for the experiments.The experimental results revealed that an enhanced bag-of-words model can perform better than complex techniques that implement higher N-gram with part-of-speech tagging.Keywords
Bag-of-words, Forum Metadata, Web Forum,Question Detection, Dimensionality Reduction, Web Forum Question- An Optimized Semantic Technique for Multi- Document Abstractive Summarization
Abstract Views :127 |
PDF Views:0
Authors
Affiliations
1 Faculty of Computing, Universiti Teknologi Malaysia, 81310, Skudai, Johor, MY
1 Faculty of Computing, Universiti Teknologi Malaysia, 81310, Skudai, Johor, MY
Source
Indian Journal of Science and Technology, Vol 8, No 32 (2015), Pagination:Abstract
Background/Objective: Multi-document summarization produces a concise summary from several online topically related documents. A major challenge in this domain is usually the information overlap in documents emanating from various sources. This paper introduces an optimized semantic technique for multi-document abstractive summarization. Methods/Statistical Analysis: Linguistic and semantic approaches are usually employed for abstractive summarization of multiple documents. Linguistic approaches lack semantic representation of source text while semantic approaches mostly rely on human experts to construct domain ontology and rules; which require immense time and effort. The technique in this paper utilizes the benefits of semantic role labeling, clustering and Particle Swarm Optimization (PSO) to rank predicate argument structures (semantic representation) in each cluster using optimized features. Findings: The summary quality is susceptible to the text features i.e., different features have varied importance towards summary generation. Therefore, optimal features weights obtained using PSO integrated in the semantic technique to rank semantic representation improved summarization results. The performance of the technique is evaluated against the benchmark summarization systems using pyramid evaluation measures (mean coverage score, precision and F-measure). A Paired- Samples T-test is carried out to validate the summarization results. Applications/Improvements: Experiment of this research is performed with DUC-2002, a benchmark data set for text summarization. Experimental results confirm that the proposed technique yields better results than other comparison summarization models in terms of mean coverage score and average F-measure.Keywords
Language Generation, Multi-Document Abstractive Summarization, Particle Swarm Optimization, Semantic Similarity Measure, Semantic Role Labeling (SRL).- Shape-Based Plagiarism Detection for Flowchart Figures in Texts
Abstract Views :193 |
PDF Views:109
Authors
Affiliations
1 Universiti Teknologi Malaysia, Skudai, MY
1 Universiti Teknologi Malaysia, Skudai, MY
Source
AIRCC's International Journal of Computer Science and Information Technology, Vol 6, No 1 (2014), Pagination: 113-124Abstract
Plagiarism detection is well known phenomenon in the academic arena. Copying other people is considered as serious offence that needs to be checked. There are many plagiarism detection systems such as turn-it-in that has been developed to provide this checks. Most, if not all, discard the figures and charts before checking for plagiarism. Discarding the figures and charts results in look holes that people can take advantage. That means people can plagiarized figures and charts easily without the current plagiarism systems detecting it. There are very few papers which talks about flowcharts plagiarism detection. Therefore, there is a need to develop a system that will detect plagiarism in figures and charts. This paper presents a method for detecting flow chart figure plagiarism based on shape-based image processing and multimedia retrieval. The method managed to retrieve flowcharts with ranked similarity according to different matching sets.Keywords
Flowchart, Multimedia Retrieval, Figures Similarity, Image Comparison, Figure Retrieval.- Improving Triangle-Graph Based Text Summarization using Hybrid Similarity Function
Abstract Views :174 |
PDF Views:0
Authors
Affiliations
1 Faculty of Computing, Universiti Teknologi Malaysia, 81310, Skudai, Johor, MY
1 Faculty of Computing, Universiti Teknologi Malaysia, 81310, Skudai, Johor, MY