Topic Modeling for Support Ticket using Latent Dirichlet Allocation

  • Wiranto Wiranto Universitas Sebelas Maret
  • Mila Rosyida Uswatunnisa Sebelas Maret University
Keywords: Topic Modeling, Support Ticket, Latent Dirichlet Allocation


In the business world, communication over customers must be built properly to make it easier for companies to find out what customers want. Support ticket is one of the business instrument for communication between the customers and the companies. Through a support ticket, customers can respond, complain or ask questions about products with a support team. Increasing the business process of the companies will be increasing the support ticket volume that should be handled by support team. It also has a value for analysis to get business intelligence decision. With that chance, an efficient data processing method is needed to find topics are being discussed by customers. One way that can be used to solve this problem is Topic Modeling. This research uses several parameters the number of topics, alpha value, beta value, iteration, and random seed. With this combination of parameters, the best results based on evaluation of human judgement and topic coherence with 5 topics, an alpha value of 50, a beta value of 0.01, 100 iterations, and 50 random seeds. The five topics interpretation consists of hosting migration, error problems in wordpress, domain email settings and domain transfer, ticketing and transaction processing. The total of 5 topics has a coherence value of 0.507897.


Download data is not yet available.


C. C. A. Blaz and K. Becker, “Sentiment Analysis in Tickets for IT Support,” in 2016 IEEE/ACM 13th Working Conference on Mining Software Repositories (MSR), May 2016, pp. 235–246.

C. Werner, G. Tapuc, L. Montgomery, D. Sharma, S. Dodos, and D. Damian, “How Angry are Your Customers? Sentiment Analysis of Support Tickets that Escalate,” in 2018 1st International Workshop on Affective Computing for Requirements Engineering (AffectRE), Aug. 2018, pp. 1–8,

doi: 10.1109/AffectRE.2018.00006.

A. M. Abirami and V. Gayathri, “A survey on sentiment analysis methods and approach,” in 2016 Eighth International Conference on Advanced Computing (ICoAC), Jan. 2017, pp. 72–76,

doi: 10.1109/ICoAC.2017.7951748.

B. K. Bhavitha, A. P. Rodrigues, and N. N. Chiplunkar, “Comparative study of machine learning techniques in sentimental analysis,” in 2017 International Conference on Inventive Communication and Computational Technologies (ICICCT), Mar. 2017, pp. 216–221, doi: 10.1109/ICICCT.2017.7975191.

J. A. Lossio-Ventura, J. Morzan, H. Alatrista-Salas, T. Hernandez-Boussard, and J. Bian, “Clustering and topic modeling over tweets: A comparison over a health dataset,” in 2019 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Nov. 2019, pp. 1544–1547, doi: 10.1109/BIBM47256.2019.8983167.

D. M. Blei, A. Y. Ng, and M. I. Jordan, “Latent Dirichlet Allocation,” p. 30, 2003.

D. Chehal, P. Gupta, and P. Gulati, “Implementation and comparison of topic modeling techniques based on user reviews in e-commerce recommendations,” J Ambient Intell Human Computer, Apr. 2020, doi: 10.1007/s12652-020-01956-6.

A. Uteuov, “Topic model for online communities’ interests prediction,” Procedia Computer Science, vol. 156, pp. 204–213, 2019, doi: 10.1016/j.procs.2019.08.196.

M. Allahyari and K. Kochut, “Discovering Coherent Topics with Entity Topic Models,” in 2016 IEEE/WIC/ACM International Conference on Web Intelligence (WI), Oct. 2016, pp. 26–33

doi: 10.1109/WI.2016.0015.

R. Feldman and J. Sanger, The text mining handbook: advanced approaches in analyzing unstructured data. Cambridge ; New York: Cambridge University Press, 2007.

D. P. Langgeni and Z. A. Baizal, “Clustering Artikel Berita Berbahasa Indonesia Menggunakan Unsupervised Feature Selection,” p. 10, 2010.

A. F. Hidayatullah and M. R. Ma’arif, “Road traffic topic modeling on Twitter using latent dirichlet allocation,” in 2017 International Conference on Sustainable Information Engineering and Technology (SIET), Malang, Nov. 2017, pp. 47–52.

doi: 10.1109/SIET.2017.8304107.

M. Hagras, G. Hassan, and N. Farag, “Towards Natural Disasters Detection from Twitter Using Topic Modelling,” in 2017 European Conference on Electrical Engineering and Computer Science (EECS), Nov. 2017, pp. 272–279, doi: 10.1109/EECS.2017.57.

T. L. Griffiths and M. Steyvers, “Finding scientific topics,” Proceedings of the National Academy of Sciences, vol. 101, no. Supplement 1, pp. 5228–5235, Apr. 2004

doi: 10.1073/pnas.0307752101.

J. H. Lau, D. Newman, and T. Baldwin, “Machine Reading Tea Leaves: Automatically Evaluating Topic Coherence and Topic Model Quality,” in Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, Gothenburg, Sweden, 2014, pp. 530–539.

doi: 10.3115/v1/E14-1056.

M. Omar, B.-W. On, I. Lee, and G. S. Choi, “LDA Topics: Representation and Evaluation,” Journal of Information Science, p. 14, 2015.

J. Chang, J. Boyd-Graber, S. Gerrish, C. Wang, and D. M. Blei, “Reading Tea Leaves: How Humans Interpret Topic Models,” 2009.

D. O’Callaghan, D. Greene, J. Carthy, and P. Cunningham, “An analysis of the coherence of descriptors in topic modeling,” Expert Systems with Applications, vol. 42, no. 13, pp. 5645–5657, Aug. 2015.

doi: 10.1016/j.eswa.2015.02.055.

D. Newman, J. H. Lau, K. Grieser, and T. Baldwin, “Automatic Evaluation of Topic Coherence,” in Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Los Angeles, California, Jun. 2010, pp. 100–108, Accessed: Oct. 03, 2020.

L. Montgomery and D. Damian, “What do Support Analysts Know About Their Customers? On the Study and Prediction of Support Ticket Escalations in Large Software Organizations,” in 2017 IEEE 25th International Requirements Engineering Conference (RE), Lisbon, Portugal, Sep. 2017, pp. 362–371

doi: 10.1109/RE.2017.61.

G. Aalipour, P. Kumar, S. Aditham, T. Nguyen, and A. Sood, “Applications of Sequence to Sequence Models for Technical Support Automation,” in 2018 IEEE International Conference on Big Data (Big Data), Seattle, WA, USA, Dec. 2018, pp. 4861–4869, doi: 10.1109/BigData.2018.8622395.

A. Angelpreethi and S. B. R. Kumar, “An Enhanced Architecture for Feature Based Opinion Mining from Product Reviews,” in 2017 World Congress on Computing and Communication Technologies (WCCCT), Tiruchirappalli, Tamil Nadu, India, Feb. 2017, pp. 89–92.

doi: 10.1109/WCCCT.2016.30.

Nazief, Bobby dan Mirna Adriani, "Confix-Stripping : Approach to Stemming Algorithm for Bahasa Indonesia", in 2004, Faculty of Computer Science University of Indonesia.

Wang, W., Feng, Y., Dai, W., 2018. Topic Analysis Of Online Reviews For Two Competitive Products Using Latent Dirichlet Allocation. Electron. Commer. Res. Appl. 29, 142–156. https://Doi.Org/10.1016/J.Elerap.2018.04.003

Valencia, J.D.M., Laure, A.J.T., Centino, N.M.R., Fabito, B.S., Imperial, J.M.R., Rodriguez, R.L., De La Cruz, A.H., Octaviano, M.V., Jamis, M.N., 2019. Understanding Anonymous Social Media Posts Using Topic Modeling, In: 2019 IEEE 11th International Conference On Humanoid, Nanotechnology, Information Technology, Communication And Control, Environment, And Management ( Hnicem ). Presented At The 2019 Ieee 11th International Conference On Humanoid, Nanotechnology, Information Technology, Communication And Control, Environment, And Management, IEEE, Laoag, Philippines, pp.1-4.


Sethasathien, N., Prasertsom, P., 2020. Research Topic Modeling: A Use Case For Data Analytics On Research Project Data, In: 2020 1st International Conference On Big Data Analytics And Practices (Ibdap). Presented At The 2020 1st International Conference On Big Data Analytics And Practices (Ibdap), pp. 1–6.


How to Cite
Wiranto, W., & Mila Rosyida Uswatunnisa. (2022). Topic Modeling for Support Ticket using Latent Dirichlet Allocation . Jurnal RESTI (Rekayasa Sistem Dan Teknologi Informasi), 6(6), 998 - 1005.
Artikel Teknologi Informasi