Analyzing imbalanced online consumer review data in product design using geometric semantic genetic programming
Created by W.Langdon from
gp-bibliography.bib Revision:1.8081
- @Article{CHAN:2021:EAAI,
-
author = "Kit Yan Chan and C. K. Kwong and Huimin Jiang",
-
title = "Analyzing imbalanced online consumer review data in
product design using geometric semantic genetic
programming",
-
journal = "Engineering Applications of Artificial Intelligence",
-
volume = "105",
-
pages = "104442",
-
year = "2021",
-
ISSN = "0952-1976",
-
DOI = "doi:10.1016/j.engappai.2021.104442",
-
URL = "https://www.sciencedirect.com/science/article/pii/S0952197621002906",
-
keywords = "genetic algorithms, genetic programming, New product
development, Social media, Online customer reviews,
Imbalanced data mining, Multi-objective optimization",
-
abstract = "To develop a successful product, understanding the
relationship between customer satisfaction (CS) and
design attributes of a new product is essential.
Nowadays IoT technologies are used to collect online
review data from social media. More representative CS
models are developed using online review data. However,
online review data is imbalanced, since popular
products receive more online consumer reviews and
unpopular products receive less. When imbalanced data
is used, CS models learn the characteristics of
majority data while rarely learning minority data.
Misleading analysis for product development is made
since the CS model is biased to popular products. This
paper proposes an approach to generate nondominated CS
models which learn equally to imbalanced data from
popular and unpopular products. A multi-objective
optimization problem is formulated to learn equally in
imbalanced data. This problem is proposed to be solved
by the geometric semantic genetic programming (GSGP); a
Pareto set of nondominated CS models is generated by
the GSGP. Product designers select the most preferred
models in the Pareto set. The preferred nondominated CS
model attempts to tradeoff unpopular and popular
products, to determine optimal design attributes and
maximize the CS. The case study shows that the proposed
GSGP is able to generate CS models with more accurate
CS predictions compared to the commonly used methods.
The proposed GSGP also generates a Pareto set of
nondominated CS models which equally learn consumer
reviews for those dryers. Based on the Pareto set, the
design team selects the most preferred CS model",
- }
Genetic Programming entries for
Kit Yan Chan
Che Kit Kwong
Huimin Jiang
Citations