Informatica logo


Login Register

  1. Home
  2. To appear
  3. Ontology-Based Knowledge Graph Approach ...

Informatica

Information Submit your article For Referees Help ATTENTION!
  • Article info
  • Full article
  • Related articles
  • More
    Article info Full article Related articles

Ontology-Based Knowledge Graph Approach for Legal Queries
Vuong T. Pham   Dang V. Dung   Hung Q. Ngo   Binh Nguyen   Nhon V. Do   Ali Selamat   Hien D. Nguyen ORCID icon link to view author Hien D. Nguyen details  

Authors

 
Placeholder
https://doi.org/10.15388/25-INFOR617
Pub. online: 5 January 2026      Type: Research Article      Open accessOpen Access

Received
1 October 2024
Accepted
1 December 2025
Published
5 January 2026

Abstract

In the legal domain, ontologies organize legal concepts and their relationships, while knowledge graphs connect these concepts to specific entities in legal documents. This study proposes a solution for integrating ontology and knowledge graph, called Legal-Onto model, to construct a knowledge base of an intelligent retrieval system in the legal domain. The Legal-Onto model combines ontology as the conceptual layer and knowledge graphs as the implementation layer for representing the content of legal documents. This relational model is integrated with a structure of knowledge graph to identify relations between concepts and entities extracted from ontology in the determined domain. Moreover, this research addresses inherent challenges in semantic-based knowledge-driven search. The specific objective is to accurately extract relevant information from legal documents to respond to entered queries. The experimental results show that this method is more effective than state-of-the-art methods in natural language processing and large language models, which are without specific legal domain knowledge.

References

 
Abu-Salih, B., Alotaibi, S. (2024). A systematic literature review of knowledge graph construction and application in education. Heliyon, 10(3), 25383.
 
Ajevski, M., Barker, K., Gilbert, A., Hardie, L., Ryan, F. (2023). ChatGPT and the future of legal education and practice. The Law Teacher, 57(3), 352–364.
 
Allison, N.G. (2023). From semantic weight to legal ontology via classification of concepts in legal texts. The Law Teacher, 57(2), 201–217.
 
Casillas, J., Cordón, O., Triguero, F.H., Magdalena, L. (2013). Interpretability Issues in Fuzzy Modeling. Springer-Verlag, Berlin Heidelberg.
 
Chakraborty, C., Khan, M. (2023). Big data-driven futuristic fabric system in societal digital transformation. Big Data, 11(5), 321–322.
 
Dang, D., Nguyen, H., Le, T., Do, H., Nguyen, H., Ngo, H.Q., Nguyen, H.D. (2025). Knowledge graph-based legal query system with LLM and retrieval augmented generation. In: Proceedings of the 17th Asian Conference on Intelligent Information and Database Systems (ACIIDS 2025), Part 1, pp. 161–172.
 
De Cruz, P. (2024). Comparative Law in a Changing Word. Taylor & Francis, London.
 
Delvin, J., Chang, M.W., Kenton, L., Toutanova, K. (2019). BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of NAACL-HLT 2019, pp. 4171–4186.
 
Desai, D., Ghadge, A., Wazare, R., Bagade, J. (2021). A comparative study of information retrieval models for short document summaries. In: Proceedings of Fourth International Conference on Computer Networks and Inventive Communication Technologies (ICCNCT 2021), pp. 547–562.
 
Dhani, J.S., Bhatt, R., Balaji, G., Sirohi, P., Bhatnagar, V. (2023). Similar cases recommendation using legal knowledge graphs. In: Proceedings of 3rd Symposium on Artificial Intelligence and Law (SAIL 2023), pp. 1–12.
 
Ding, Y., Li, H., Zhu, F., Wang, Z., Peng, W., Xie, M., (2023). A semi-supervised failure knowledge graph construction method for decision support in operations and maintenance. IEEE Transactions on Industrial Informatics, 20(3), 3104–3114.
 
Do, N., Nguyen, H.D., Selamat, A. (2018). Knowledge-based model of expert systems using Rela-model. International Journal of Software Engineering and Knowledge Engineering, 28(08), 1047–1090.
 
GoogleAI (2024). Gemini. https://gemini.google.com/ [Accessed: 08 July 2024].
 
Gunkel, D. (2023). Person, Thing, Robot: A Moral and Legal Ontology for the 21st Century and Beyond. MIT Press, Cambridge, Massachusetts.
 
Hoang, L., Bui, T., Nguyen, C., Nguyen, L.-M. (2023). AIEPU at ALQAC 2023: deep learning methods for legal information retrieval and question answering. In: Proceedings of 15th International Conference on Knowledge and Systems Engineering (KSE 2023). bpublisherIEEE, pp. 1–6.
 
Hogan, A., Blomqvist, E., Cochez, M., d’Amato, C., et al. (2021). Knowledge graphs. ACM Computing Surveys (Csur), 54(4), 71.
 
Junior, A.C., Orlandi, F., Graux, D., Hossari, M., O’Sullivan, D., Hartz, C., Dirschl, C. (2020). Knowledge graph-based legal search over German Court cases. In: Proceedings of 17th European Semantic Web Conference (ESWC 2020), pp. 293–297.
 
Khan, N., Ma, Z., Ullah, A., Polat, K. (2022). DCA-IoMT: knowledge-graph-embedding-enhanced deep collaborative alert recommendation against COVID-19. IEEE Transactions on Industrial Informatics, 18(12), 8924–8935.
 
Le, H.H., Nguyen, C.-T., Ngo, T.P., Vinh, P.V., Nguyen, B.T., Huynh, A.T., Nguyen, H.D., (2023). Intelligent retrieval system on legal information. In: Proceedings of 15th Asian Conference on Intelligent Information and Database Systems (ACIIDS 2023), pp. 97–108. Springer.
 
Le, T., Luu, S., Nguyen, H., Do, N. (2019). Knowledge representation for designing an Intelligent Tutoring System in learning of courses about algorithms. In: Proceedings of 25th Asia-Pacific Conference on Communications (APCC 2019), pp. 310–315.
 
Li, M. (2024). Adapting legal education for the changing landscape of regional emerging economies: a dynamic framework for law majors. Journal of the Knowledge Economy, 15(3), 10227–10256.
 
Liu, Y., Ott, M., Goyal, N., Du, J., Joshi, M., Chen, D., Levy, O., Lewis, M., Zettlemoyer, L., Stoyanov, V. (2019). RoBERTa: A robustly optimized BERT pretraining approach. arXiv preprint arXiv:1907.11692.
 
Liu, Y., Han, T., Ma, S., et al. (2023). Summary of ChatGPT-Related research and perspective towards the future of large language models. Meta-Radiology, 1(2), 100017.
 
Manning, C., Raghavan, P., Schütze, H. (2009). Introduction to Information Retrieval. Cambridge University, Cambridge.
 
Maroudas, S., Legkas, S., Malakasiotis, P., Chalkidis, I. (2022). Legal-tech open diaries: lesson learned on how to develop and deploy lightweight models in the era of humongous language models. In: Proceedings of the Natural Legal Language Processing Workshop 2022, pp. 88–110.
 
Martinez-Gil, J. (2023). A survey on legal question–answering systems. Computer Science Review, 48, 100552.
 
Meta (2024). Introducing LLaMa: a foundational, 65-billion-parameter language model. https://ai.meta.com/blog/large-language-model-llama-meta-ai/ [Accessed: 08 July 2024].
 
Min, B., Ross, H., Sulem, E., et al. (2023). Recent advances in natural language processing via large pre-trained language models: a survey. ACM Computing Surveys, 56(2), 1–40.
 
Mishra, A., Vishwakarma, S. (2015). Analysis of TF-IDF model and its variant for document retrieval. In: Proceedings of 2015 International Conference on Computational Intelligence and Communication Networks (CICN 2015), pp. 772–776.
 
Mohan, D., Nair, L.R. (2022). Sammon quadratic recurrent multilayer deep classifier for legal document analytics. Computers, Materials & Continua, 72(2), 3039–3053.
 
Mökander, J., Schuett, J., Kirk, H.R., Floridi, L. (2024). Auditing large language models: a three-layered approach. AI and Ethics, 4, 1085–1115.
 
Moneus, A.M., Sahari, Y. (2024). Artificial intelligence and human translation: a contrastive study based on legal texts. Heliyon, 10(6).
 
Naseem, U., Razzak, I., Khan, S.K., Prasad, M. (2021). A comprehensive survey on word representation models: from classical to state-of-the-art word representation language models. Transactions on Asian and Low-Resource Language Information Processing, 20(5), 1–35.
 
Ngo, H., Nguyen, H., Le-Khac, N.A. (2024). Ontology knowledge map approach towards building linked data for Vietnamese legal applications. Vietnam Journal of Computer Science, 11(02), 323–342.
 
Ngo, H.Q., Nguyen, H.D., Le-Khac, N.-A. (2023). Building legal knowledge map repository with nlp toolkits. In: Proceedings of 12th Conference on Information Technology and its Applications (CITA 2023). Springer, pp. 25–36.
 
Nguyen, H.D., Do, N.V., Tran, N.P., et al. (2020). Some criteria of the Knowledge Representation method for an Intelligent Problem Solver in STEM education. Applied Computational Intelligence and Soft Computing, 2020(1), 9834218.
 
Nguyen, H.D., Tran, T.-V., Pham, X.-T., Huynh, A.T., Do, N.V. (2021). Ontology-based integration of knowledge base for building an intelligent searching chatbot. Sensors & Materials, 33(9), 3101–3121.
 
Nguyen, H.D., Huynh, H., Mai, T., et al. (2023a). Design an ontology-based model for intelligent querying system in mathematics education. Journal of Interdisciplinary Mathematics, 26(3), 449–473.
 
Nguyen, H.D., Truong, D., Vu, S., et al. (2023b). Knowledge management for information querying system in education via the combination of rela-ops model and knowledge graph. Journal of Cases on Information Technology (JCIT), 25(1), 13.
 
Nguyen, H., Pham, V., Ngo, H.Q., Huynh, A., Nguyen, B., Machado, J. (2024). Intelligent search system for resume and labor law. PeerJ Computer Science, 10, 1786.
 
Nguyen, T.-S., Nguyen, L.-M., Tojo, S., et al. (2018). Recurrent neural network-based models for recognizing requisite and effectuation parts in legal texts. Artificial Intelligence and Law, 26, 169–199.
 
Nguyen, T., Nguyen, H.D., Pham, V.T., et al. (2022). Legal-Onto: an ontology-based model for representing the knowledge of a legal document. In: Proc. of 17th International Conference on Evaluation of Novel Approaches to Software Engineering (ENASE 2022), Online, pp. 426–434.
 
Oliveira, F.d., Oliveira, J.M.P.d. (2024). A RDF-based graph to representing and searching parts of legal documents. Artificial Intelligence and Law, 32, 667–695.
 
Peng, C., Xia, F., Naseriparsa, M., Osborne, F. (2023). Knowledge graphs: opportunities and challenges. Artificial Intelligence Review, 56(11), 13071–13102.
 
Pham, V.T., Nguyen, H.D., Le, T., B., N., Ngo, H.Q. (2023). Ontology-based solution for building an intelligent searching system on traffic law documents. In: Proceedings of 15th International Conference on Agents and Artificial Intelligence (ICAART 2023), Lisbon, Portugal, pp. 426–434.
 
Robertson, S., Zaragoza, H. (2009). The probabilistic relevance framework: BM25 and beyond. Foundations and Trends in Information Retrieval, 3(4), 333–389.
 
Sansone, C., Sperlí, G. (2022). Legal information retrieval systems: state-of-the-art and open issues. Information Systems, 106, 101967.
 
Sartor, G., Araszkiewicz, M., Atkinson, K., et al. (2022). Thirty years of artificial intelligence and law: the second decade. Artificial Intelligence and Law, 30(4), 521–557.
 
Satti, M.I., Ahmed, J., Muslim, H.S.M., et al. (2023). Ontology-based news linking for semantic temporal queries. Computers, Materials & Continua, 74(2), 3913–3929.
 
Sejnowski, T. (2023). Large language models and the reverse turing test. Neural Computation, 35(3), 309–342.
 
Shao, Y., Wu, Y., Liu, Y., et al. (2023). Understanding relevance judgments in legal case retrieval. ACM Transactions on Information Systems, 41(3), 1–32.
 
Sikos, L.F. (2021). AI in digital forensics: ontology engineering for cybercrime investigations. Wiley Interdisciplinary Reviews: Forensic Science, 3(3), 1394.
 
Soni, S., Chouhan, S., Rathore, S. (2023). Textconvonet: a convolutional neural network based architecture for text classification. Applied Intelligence, 53(1), 14249–14268.
 
Sovrano, F., Palmirani, M., Vitali, F. (2020). Legal knowledge extraction for knowledge graph based question-answering. In: Legal Knowledge and Information Systems. IOS Press, Amsterdam, pp. 143–153.
 
Tang, Y., Qiu, R., Liu, Y., et al. (2024). CaseGNN: graph neural networks for legal case retrieval with text-attributed graphs. In: Proceedings of 46th European Conference on Information Retrieval (ECIR 2024). Springer, pp. 80–95.
 
Van, H.N., Nguyen, D., Nguyen, P.M., Le Nguyen, M. (2022). Miko team: deep learning approach for legal question answering in ALQAC 2022. In: Proceedings of 14th International Conference on Knowledge and Systems Engineering (KSE 2022). IEEE, pp. 1–5.
 
Vaswani, A., Shazeer, N., Parmar, N., et al. (2017). Attention is all you need. In: Proceedings of the 31st International Conference on Neural Information Processing Systems (NIPS 2017), pp. 6000–6010.
 
Vietnam Government (2019). Decree on Administrative penalties for road traffic and rail transport offences, No. 100/2019/ND-CP.
 
Vietnam Ministry of Transport (2019). National Technical Regulation on Traffic Signs and Signals. QCVN 41:2019/BGTVT.
 
Vietnam National Assembly (2008). Law on Road Traffic, No. 23/2008/QH12.
 
Vietnam National Assembly (2021). Amending and supplementing some articles of decrees provisions on penalties for administrative violations in the marine region, road, rail traffic, and domestic airline, No. 123/2021/ND-CP.
 
Villata, S., Araszkiewicz, M., Ashley, K., et al. (2022). Thirty years of artificial intelligence and law: the third decade. Artificial Intelligence and Law, 30(4), 561–591.
 
Vuong, Y., Hoang, Q., Nguyen, M., et al. (2023). Constructing a knowledge graph for Vietnamese legal cases with heterogeneous graphs. In: Proceedings of 15th International Conference on Knowledge and Systems Engineering (KSE 2023), Hanoi, Vietnam, pp. 1–6.
 
Wang, J., Dong, Y. (2020). Measurement of text similarity: a survey. Information, 11(9), 421.
 
Wu, Q., Yu, Y., Zhou, L., Lu, Y., Chen, H., Qian, X. (2024). Storage and query indexing methods on big data. Arabian Journal for Science and Engineering, 49(5), 7359–7374.
 
Xia, C., He, T., Li, W., Qin, Z., Zou, Z. (2019). Similarity analysis of law documents based on Word2vec. In: Proceeding of 19th IEEE International Conference on Software Quality, Reliability and Security Companion (QRS-C 2019), pp. 354–357.
 
Yoshioka, M., Suzuki, Y., Aoki, Y. (2022). HUKB at the COLIEE 2022 statute law task. In: JSAI International Symposium on Artificial Intelligence, Springer, pp. 109–124.
 
Zalesinska, A., Szostek, D., Zalucki, M. (2021). Legal Tech: Information Technology Tools in the Administration of Justice. Nomos, Baden-Baden.
 
Zouaoui, S., Rezeg, K. (2021). A novel quranic search engine using an ontology-based semantic indexing. Arabian Journal for Science and Engineering, 46(4), 3653–3674.

Biographies

Pham Vuong T.
vuong.pham@sgu.edu.vn

V.T. Pham received the BS degree in mathematics and informatics from the University of Science, VNU-HCM, Vietnam, in 2003, and the MS degree in Information Technology from University of Science, VNU-HCM, Vietnam, in 2008. He is currently a PhD student at University of Science, VNU-HCM. He is also a lecturer at Sai Gon University, Vietnam. From 2006–2019, he was a lecturer at the Faculty of Software Engineering, University of Information Technology, VNU-HCM, Vietnam. His research interests include artificial intelligence, computer science, software engineering, mathematics foundation.

Dung Dang V.
dungdv@uit.edu.vn

D.V. Dang received his MSc degree in computer science in 2023. He is currently a PhD candidate in the Department of Computer Science since 2025 and serves as a lecturer at the Faculty of Software Engineering, University of Information Technology, Vietnam National University Ho Chi Minh City, Vietnam. His research interests include knowledge representation, automated reasoning, and knowledge engineering.

Ngo Hung Q.
hung.ngo@ucd.ie

Hung Q. Ngo is a PostDoc Researcher at the School of Computer Science, University College Dublin (UCD), Ireland. He received his PhD degree in computer science from the University College Dublin in 2022 and his MSc degree in computer science from the University of Science, Vietnam National University – Ho Chi Minh City (VNUHCM), Vietnam in 2008. His research interests include knowledge management, natural language processing, and intelligent systems in cross-disciplinary domains, such as bioinformatics, digital agriculture, legal AI, and digital forensics.

Nguyen Binh
ngtbinh@hcmus.edu.vn

B.T. Nguyen received the PhD degree (Hons.) from Ecole Polytechnique, Paris, France, in 2012. He is currently the head of the Department of Computer Science and an associate professor of Computer Science with the Faculty of Mathematics and Computer Science, University of Science, Vietnam National University Ho Chi Minh City, Ho Chi Minh City, Vietnam. He has over ten years of experience in AI and data science. Up to now, he has had over 100 publications and 04 patents filed in USA and Canada. He has substantial experience building research and development teams to help companies or startups deliver AI products.

Do Nhon V.
nhondv@hiu.vn

N.V. Do received his MSc and PhD degrees in computer science from University of Science, VNU-HCM, Vietnam in 1995 and 2002, respectively. He was an Associate Professor at University of Information Technology, VNU-HCM, Vietnam, from 2006 to 2018 and at Hoa Sen University, Vietnam, from 2018 to 2019. He has been an associate professor and the head of Information Technology Department at Hong Bang International University, Vietnam, from 2020. His research interests include artificial intelligence, computer science, and their practical applications, especially intelligent systems and knowledge-based systems.

Selamat Ali
aselamat@utm.my

A. Selamat is currently a full professor with Universiti Teknologi Malaysia (UTM), Malaysia. He is a Deputy Vice Chancellor (Student Affairs & Alumni), UTM. He was the Dean of the Malaysia Japan International Institute of Technology (MJIIT), UTM, since 2018. He is also a Professor with the Software Engineering Department, Faculty of Computing, UTM, and was the Chair of the IEEE Computer Society Malaysia Section. He has published more than 120 research articles with IF JCR, with more than 2400 citations received in the Web of Science. His research interests include software engineering, software agents, web engineering, information retrievals, pattern recognition, genetic algorithms, neural networks, soft computing, collective computational intelligence, strategic management, key performance indicator, and knowledge management.

Nguyen Hien D.
https://orcid.org/0000-0002-8527-0602
hiennd@uit.edu.vn

H.D. Nguyen is currently an associate professor at the Faculty of Computer Science, University of Information Technology, VNU-HCM, Vietnam, from 2008. He was a Visiting Assistant Professor at the Computer Science Department, New Mexico State University (NMSU), USA, in 2024–2025. His research interests include knowledge representation, automated reasoning, and knowledge engineering, especially intelligent systems in education, such as intelligent problem solvers. He received the Best Paper Awards at CITA 2023, SOMET 2022, and ICOCO 2022.


Full article Related articles PDF XML
Full article Related articles PDF XML

Copyright
© 2025 Vilnius University
by logo by logo
Open access article under the CC BY license.

Keywords
knowledge-based systems knowledge graph legal domain information retrieval knowledge representation

Metrics
since January 2020
255

Article info
views

41

Full article
views

51

PDF
downloads

4

XML
downloads

Export citation

Copy and paste formatted citation
Placeholder

Download citation in file


Share


RSS

INFORMATICA

  • Online ISSN: 1822-8844
  • Print ISSN: 0868-4952
  • Copyright © 2023 Vilnius University

About

  • About journal

For contributors

  • OA Policy
  • Submit your article
  • Instructions for Referees
    •  

    •  

Contact us

  • Institute of Data Science and Digital Technologies
  • Vilnius University

    Akademijos St. 4

    08412 Vilnius, Lithuania

    Phone: (+370 5) 2109 338

    E-mail: informatica@mii.vu.lt

    https://informatica.vu.lt/journal/INFORMATICA
Powered by PubliMill  •  Privacy policy