Attribute based content mining for regional Web documents

Raman, A.R. and Dorai Rangaswamy, M.A. and Prakash, K.B. (2013) Attribute based content mining for regional Web documents. In: IET Chennai Fourth International Conference on Sustainable Energy and Intelligent Systems (SEISCON 2013), 12-14 Dec. 2013, Chennai, India.

Full text not available from this repository.

Abstract

The rapid growth of the Internet has made extracting information from Web pages critical. Pages contain main content as well as noisy blocks like navigation, copyright, ads, and hyperlinks, which hinder clustering, classification, and retrieval. Challenges increase with regional languages. This paper presents a novel pixel-map-based approach for content extraction and knowledge creation for illiterate users, combining pixel attributes, pattern matching, statistical models, and Artificial Neural Networks. The goal is concept-based term analysis at sentence and document levels rather than single-term analysis. © 2015 Elsevier B.V., All rights reserved.

Item Type: Conference or Workshop Item (Paper)
Subjects: Computer Science > Information Systems
Divisions: Engineering and Technology > Aarupadai Veedu Institute of Technology, Chennai
Depositing User: Unnamed user with email techsupport@mosys.org
Last Modified: 10 Dec 2025 07:00
URI: https://vmuir.mosys.org/id/eprint/4303

Actions (login required)

View Item
View Item