Analysis for the implementation of an automated technical support system based on LLM and Docker
Containerized architecture for the automation of primary care through artificial intelligence
DOI:
https://doi.org/10.5281/zenodo.19889233Keywords:
Automated technical support, Large Language Models (LLM), Docker containers, Artificial Intelligence, Microservices architecture, Natural language processing, Incident management automationAbstract
This article presents the design and implementation of an automated technical support system based on a large-scale language model (LLM), deployed using Docker containers and integrated with a web interface developed in Python. The goal is to optimize technical incident management by automating first-level support. The system interprets requests in natural language, classifies tickets according to their complexity level, and generates contextualized responses. The proposed architecture allows for scalability, service isolation, and efficient maintenance. Experimental results show a significant reduction in response times and improved load distribution across support levels. Finally, limitations related to automatic classification and performance based on computational resources are discussed, proposing future improvements in specialized models and system optimization.References
Brown, T. B., Mann, B., Ryder, N., Subbiah, M., Kaplan, J., Dhariwal, P., et al. (2020). Language models are few-shot learners. Advances in Neural Information Processing Systems, 33, 1877–1901.
Davenport, T. H., & Ronanki, R. (2018). Artificial intelligence for the real world. Harvard Business Review, 96(1), 108–116.
Freshworks. (2026). Freshdesk: AI-powered platform for modern customer service. Recuperado el 19 de marzo de 2026 de https://www.freshworks.com/freshdesk/
Géron, A. (2022). Hands-on machine learning with Scikit-Learn, Keras, and TensorFlow (3rd ed.). O’Reilly Media.
Huang, M. H., & Rust, R. T. (2021). Artificial intelligence in service. Journal of Service Research, 24(1), 3–16. https://doi.org/10.1177/1094670520902266
ITIL. (2019). ITIL Foundation: ITIL 4 Edition. AXELOS.
Lin, C.-Y. (2004). ROUGE: A package for automatic evaluation of summaries. In Proceedings of the Workshop on Text Summarization Branches Out (pp. 74–81). Association for Computational Linguistics.
Merkel, D. (2014). Docker: Lightweight Linux containers for consistent development and deployment. Linux Journal, 2014(239), 2.
Newman, S. (2015). Building microservices. O’Reilly Media.
Norman, D. (2013). The design of everyday things. Basic Books.
OpenAI. (2023). GPT-4 technical report. arXiv:2303.08774.
Papineni, K., Roukos, S., Ward, T., & Zhu, W.-J. (2002). BLEU: A method for automatic evaluation of machine translation. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (pp. 311–318). Association for Computational Linguistics.
Richards, M., & Ford, N. (2020). Fundamentals of software architecture. O’Reilly Media.
Van Rossum, G., & Drake, F. L. (2009). Python 3 reference manual. CreateSpace.
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., et al. (2017). Attention is all you need. Advances in Neural Information Processing Systems, 30.
Zendesk. (2026). AI for customer service. Recuperado el 19 de marzo de 2026 de https://www.zendesk.com/service/ai/.
Agarwal, S., Sindhgatta, R., & Sengupta, B. (2012). SmartDispatch: Enabling efficient ticket dispatch in an IT service environment. In Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM. doi:10.1145/2339530.2339744.
Al-Hawari, F., & Barham, H. (2021). A machine learning based help desk system for IT service management. Journal of King Saud University - Computer and Information Sciences, 33(6), 702–718. doi:10.1016/j.jksuci.2019.04.001.
Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. (2019). BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of NAACL-HLT 2019 (pp. 4171–4186). Association for Computational Linguistics.
Gao, Y., Xiong, Y., Gao, X., Jia, K., Pan, J., Bi, Y., Dai, Y., Sun, J., Wang, M., & Wang, H. (2024). Retrieval-augmented generation for large language models: A survey. arXiv. doi:10.48550/arXiv.2312.10997.
Lewis, P., Perez, E., Piktus, A., Petroni, F., Karpukhin, V., Goyal, N., Küttler, H., Lewis, M., Yih, W.-t., Rocktäschel, T., Riedel, S., & Kiela, D. (2020). Retrieval-augmented generation for knowledge-intensive NLP tasks. arXiv. doi:10.48550/arXiv.2005.11401.
Hu, E. J., Shen, Y., Wallis, P., Allen-Zhu, Z., Li, Y., Wang, S., Wang, L., & Chen, W. (2021). LoRA: Low-rank adaptation of large language models. arXiv. doi:10.48550/arXiv.2106.09685.
Marcuzzo, M., Zangari, A., Schiavinato, M., Giudice, L., Gasparetto, A., & Albarelli, A. (2022). A multi-level approach for hierarchical ticket classification. In Proceedings of the Eighth Workshop on Noisy User-generated Text (W-NUT 2022) (pp. 201–214). Association for Computational Linguistics.
Olujimi, P. A., & Ade-Ibijola, A. (2023). NLP techniques for automating responses to customer queries: A systematic review. Discover Artificial Intelligence, 3, 20. doi:10.1007/s44163-023-00065-5.
Downloads
Published
How to Cite
Conference Proceedings Volume
Section
License
Copyright (c) 2026 AMAURY CASTILLO CRUZ, Emmanuel Tonatihu Juarez Velázquez

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
