Raman, A.R. and Dorai Rangaswamy, M.A. and Prakash, K.B. (2013) Attribute based content mining for regional Web documents. In: IET Chennai Fourth International Conference on Sustainable Energy and Intelligent Systems (SEISCON 2013), 12-14 Dec. 2013, Chennai, India.
Full text not available from this repository.Abstract
The rapid growth of the Internet has made extracting information from Web pages critical. Pages contain main content as well as noisy blocks like navigation, copyright, ads, and hyperlinks, which hinder clustering, classification, and retrieval. Challenges increase with regional languages. This paper presents a novel pixel-map-based approach for content extraction and knowledge creation for illiterate users, combining pixel attributes, pattern matching, statistical models, and Artificial Neural Networks. The goal is concept-based term analysis at sentence and document levels rather than single-term analysis. © 2015 Elsevier B.V., All rights reserved.
| Item Type: | Conference or Workshop Item (Paper) |
|---|---|
| Subjects: | Computer Science > Information Systems |
| Divisions: | Engineering and Technology > Aarupadai Veedu Institute of Technology, Chennai |
| Depositing User: | Unnamed user with email techsupport@mosys.org |
| Last Modified: | 10 Dec 2025 07:00 |
| URI: | https://vmuir.mosys.org/id/eprint/4303 |
Dimensions
Dimensions