A Novel Lexicalized HMM-based Learning Framework for Web Opinion Mining
This paper introduces a machine learning framework based on Lexicalized Hidden Markov Models (HMMs) for extracting and analyzing opinions from web product reviews. The framework aims to identify product-related entities (such as features and components) and classify their associated opinions (positive or negative). By integrating part-of-speech tags and contextual information into the HMMs, the model improves its accuracy in identifying both frequent and infrequent opinion phrases. The method also incorporates a bootstrapping process to self-learn new vocabularies, reducing the need for extensive manual labeling. Experimental results show that this framework outperforms rule-based methods in opinion sentence extraction and opinion polarity classification, providing a more robust solution for opinion mining in e-commerce environments.