Skip to main content

Technology Analysis from Patent Data Using Latent Dirichlet Allocation

  • Conference paper
Soft Computing in Big Data Processing

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 271))

Abstract

This paper discusses how to apply latent Dirichlet allocation, a topic model, in a trend analysis methodology that exploits patent information. To accomplish this, text mining is used to convert unstructured patent documents into structured data. Next, the term frequency-inverse document frequency (tf-idf) value is used in the feature selection process. After the text preprocessing, the number of topics is decided using the perplexity value. In this study, we employed U.S. patent data on technology that reduces greenhouse gases. We extracted words from 50 relevant topics and showed that these topics are highly meaningful in explaining trends per period.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Lee, S.J., Yoon, B.Y., Park, Y.T.: An approach to discovering new technology opportunities: Keyword-based patent map approach. Technovation 29, 483–484 (2009)

    Article  Google Scholar 

  2. Tseng, Y.H., Lin, C.J., Lin, Y.I.: Text mining techniques for patent analysis. Information Processing & Management 43, 1216–1247 (2007)

    Article  Google Scholar 

  3. Jun, S.H., Park, S.S., Jang, D.S.: Technology forecasting using matrix map and patent clustering. Industrial Management & Data Systems 112(5), 786–807 (2012)

    Article  Google Scholar 

  4. Yoon, B.U., Yoon, C.B., Park, Y.T.: On the development and application of a self-organizing feature map-based patent map. R&D Management 32(4), 291–300 (2002)

    Article  Google Scholar 

  5. Noh, T.G., Park, S.B., Lee, S.J.: A Semantic Representation Based-on Term Co-occurrence Network and Graph Kernel. International Journal of Fuzzy Logic and Intelligent Systems 11(4) (2011)

    Google Scholar 

  6. Blei, D.M., Lafferty, J.D.: Dynamic Topic Models. In: 23rd International Conference on Machine Learning, Pittsburgh, PA (2006)

    Google Scholar 

  7. Griffiths, T.L., Steyvers, M.: Finding scientific topics. Proceedings of the National Academy of Sciences of the United States of America 101, 5228–5235 (2004)

    Article  Google Scholar 

  8. Uhm, D., Jun, S., Lee, S.J.

    Google Scholar 

  9. Cho, J.H., Lee, D.J., Park, J.I., Chun, M.G.: Hybrid Feature Selection Using Genetic Algorithm and Information Theory. International Journal of Fuzzy Logic and Intelligent Systems 13(1) (2013)

    Google Scholar 

  10. Blei, D.V., Ng, A.Y., Jordan, M.I.: Latent Dirichlet Allocation. Journal of Machine Learning Research 3, 993–1022 (2003)

    MATH  Google Scholar 

  11. Steyvers, M., Griffiths, T.: Probabilistic topic models

    Google Scholar 

  12. Grun, B., Hornik, K.: topicmodels: An R Package for Fitting Topic Models. Journal of Statistical Software 40(13) (2011)

    Google Scholar 

  13. Simpson, M.M.: Climate Change Technology Initiative (CCTI): Research, Technology, and Related Program. CRS Report for Congress (2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Kim, G., Park, S., Jang, D. (2014). Technology Analysis from Patent Data Using Latent Dirichlet Allocation. In: Lee, K., Park, SJ., Lee, JH. (eds) Soft Computing in Big Data Processing. Advances in Intelligent Systems and Computing, vol 271. Springer, Cham. https://doi.org/10.1007/978-3-319-05527-5_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-05527-5_8

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-05526-8

  • Online ISBN: 978-3-319-05527-5

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics