Skip to main content

Predicting Product Choice with Symbolic Regression and Classification

  • Chapter
  • First Online:
Genetic Programming Theory and Practice XIII

Part of the book series: Genetic and Evolutionary Computation ((GEVO))

  • 895 Accesses

Abstract

Market researchers often conduct surveys to measure how much value consumers place on the various features of a product. The resulting data should enable managers to combine these utility values in different ways to predict the market share of a product with a new configuration of features. Researchers assess the accuracy of these choice models by measuring the extent to which the summed utilities can predict actual market shares when respondents choose from sets of complete products. The current paper includes data from 201 consumers who gave ratings to 18 cell phone features and then ranked eight complete cell phones. A simple summing of the utility values predicted the correct product on the ranking task for 22.8 % of respondents. Another accuracy measurement is to compare the market shares for each product using the ranking task and the estimated market shares based on summed utilities. This produced a mean absolute difference between ranked and estimated market shares of 7.8 %. The current paper applied two broad strategies to improve these prediction methods. Various evolutionary search methods were used to classify the data for each respondent to predict one of eight discrete choices. The fitness measure of the classification approach seeks to reduce the Classification Error Percent (CEP) which minimizes the percent of incorrect classifications. This produced a significantly better fit with the hit rate rising from 22.8 to 35.8 %. The mean absolute deviation between actual and estimated market shares declined from 7.8 to 6.1 % (p. <0.01). A simple language specification will be illustrated to define symbolic regression and classification searches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Breiman L, Friedman J, RAOlshen, Stone C (1984) Classification and regression trees. Wadsworth and Brooks, Pacific Grove

    Google Scholar 

  • Codd EF (1983) A relational model of data for large shared data banks. Commun ACM 26(1): 64–69

    Article  Google Scholar 

  • Green PE, Rao V (1971) Conjoint measurement for quantifying judgmental data. J Mark Res 8(3):355–363

    Article  Google Scholar 

  • Korns MF (2007) Large-scale, time-constrained symbolic regression-classification. In: Riolo RL, Soule T, Worzel B (eds) Genetic programming theory and practice V. Genetic and evolutionary computation, chap 4. Springer, Ann Arbor, pp 53–68. doi:10.1007/978-0-387-76308-8_4

  • Korns MF (2010) Abstract expression grammar symbolic regression. In: Riolo R, McConaghy T, Vladislavleva E (eds) Genetic programming theory and practice VIII. Genetic and Evolutionary Computation, vol 8, chap 7. Springer, Ann Arbor, pp 109–128. http://www.springer.com/computer/ai/book/978-1-4419-7746-5

    Google Scholar 

  • Korns MF (2011) Accuracy in symbolic regression. In: Riolo R, Vladislavleva E, Moore JH (eds) Genetic programming theory and practice IX. Genetic and evolutionary computation, chap 8. Springer, Ann Arbor, pp 129–151. doi:10.1007/978-1-4614-1770-5_8

    Google Scholar 

  • Marder E (1997) The laws of choice: predicting customer behavior. The Free Press, New York

    Google Scholar 

  • McCulloch W, Pitts W (1943) A logical calculus of the ideas immanent in nervous activity. Bull Math Biophys 5(4):115–133

    Article  MathSciNet  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Philip Truscott .

Editor information

Editors and Affiliations

Appendices

Appendix 1: Questionnaire Text

  1. 1.

    Operating system

    1. a.

      Android

    2. b.

      Symbian

    3. c.

      Windows

    4. d.

      Blackberry

    5. e.

      iOS (iPhone OS)

  2. 2.

    Screen size

    1. a.

      Less than 3 in.

    2. b.

      3.0–3.4 in.

    3. c.

      3.5–3.9 in.

    4. d.

      4.0–4.4 in.

    5. e.

      4.5–4.9 in.

    6. f.

      5 in. and over

  3. 3.

    Camera memory

    1. a.

      Below 2 megapixels

    2. b.

      2–4.9 megapixels

    3. c.

      5–7.9 megapixels

    4. d.

      8 Megapixels and above

  4. 4.

    Memory

    1. a.

      Below 8 GB

    2. b.

      8–15.9 GB

    3. c.

      16–31.9 GB

    4. d.

      32–63.9 GB

    5. e.

      64 GB or more

  5. 5.

    Talk time

    1. a.

      Less than 6 h

    2. b.

      6–11 h

    3. c.

      12–23 h

    4. d.

      24–35 h

    5. e.

      36 h or more

  6. 6.

    Stand by time

    1. a.

      Under 50 h

    2. b.

      50–99 h

    3. c.

      100–199 h

    4. d.

      200–299 h

    5. e.

      300 h or more

  7. 7.

    Price

    1. a.

      5000 Rs or less

    2. b.

      5001–10,000 Rs

    3. c.

      10,001–18,000 Rs

    4. d.

      18,001–35,000 Rs

    5. e.

      35,001 Rs and above

  8. 8.

    Phone thickness

    1. a.

      Less than 6 mm

    2. b.

      6–7 mm

    3. c.

      8–9 mm

    4. d.

      10–11 mm

    5. e.

      12 mm or more

  9. 9.

    CPU speed

    1. a.

      1 GHz or less

    2. b.

      1.0–1.3 GHz

    3. c.

      1.4–1.5 GHz

    4. d.

      1.6–1.9 GHz

    5. e.

      2.0 GHz or more

  10. 10.

    Warranty length

    1. a.

      Free repairs for 6 months

    2. b.

      Free repairs for 1 year

    3. c.

      Free repairs for 1.5 years

    4. d.

      Free repairs for 2 years

    5. e.

      Free repairs for 2.5 years

  11. 11.

    GPS

    1. a.

      Has GPS

    2. b.

      No GPS

  12. 12.

    Wi-Fi

    1. a.

      Has Wi-Fi

    2. b.

      No Wi-Fi

  13. 13.

    Touchscreen

    1. a.

      Has a touchscreen

    2. b.

      No touchscreen

  14. 14.

    SIM format

    1. a.

      Single SIM

    2. b.

      Dual SIM

  15. 15.

    3G

    1. a.

      Has 3G connectivity

    2. b.

      No 3G connectivity

  16. 16.

    Qwerty keyboard

    1. a.

      Has a QWERTY keyboard

    2. b.

      No QWERTY keyboard

  17. 17.

    Brand impression

    1. a.

      Apple

    2. b.

      Samsung

    3. c.

      Blackberry

    4. d.

      XOLO

    5. e.

      Spice

    6. f.

      Micromax

    7. g.

      Nokia

    8. h.

      Lava

Appendix 2: Sources of Feature Data

All feature data for the eight mobile phones were drawn from www.Flipkart.com on September 26th, 2013 except the following items that were missing from the Flipkart comparison screens.

For the iPhone 5 with 32 GB, data was missing for the CPU speed attribute. This was taken from www.GSMArena.com on September 26th 2013.

For the Samsung Galaxy Note 2, data was missing for the talk-time and standby time attributes. This was taken from www.GSMArena.com on September 26th 2013.

For the Blackberry Curve 9220, data was missing for the GPS attribute. This was taken from www.GSMArena.com on September 26th 2013. The CPU speed attribute was missing from both these sources. It was taken from asia.cnet.com on September 26th 2013.

For the XOLO Q1000, data was missing for the GPS attribute. This was taken from www.GSMArena.com on September 26th 2013.

For the Spice MI-495, data was missing for the USB connection attribute. This was taken from www.GSMArena.com on September 26th 2013. The phone thickness attribute was missing from both these sources. It was taken from comapareindia.in.com on November 5th 2013.

For the Micromax Canvas 4 A210, data was missing for the GPS attribute. This was taken from www.GSMArena.com on September 26th 2013.

For the Lava Iris 504Q, data was missing for the GPS attribute. It was taken from comapareindia.in.com on November 5th 2013.

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this chapter

Cite this chapter

Truscott, P., Korns, M.F. (2016). Predicting Product Choice with Symbolic Regression and Classification. In: Riolo, R., Worzel, W., Kotanchek, M., Kordon, A. (eds) Genetic Programming Theory and Practice XIII. Genetic and Evolutionary Computation. Springer, Cham. https://doi.org/10.1007/978-3-319-34223-8_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-34223-8_12

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-34221-4

  • Online ISBN: 978-3-319-34223-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics