Knowledge graph empowerment from knowledge learning to graduation requirements achievement

Yangrui Yang; Sisi Chen; Yaping Zhu; Hao Zhu; Zhigang Chen

doi:10.1371/journal.pone.0292903

Abstract

A deep understanding of the relationship between the knowledge acquired and the graduation requirements is essential for students to precisely meet the graduation requirements and to become human resources with specific knowledge, skills and professionalism. In this paper, we define the ontology layer of the knowledge graph by deeply analyzing the relationship between graduation requirement, course and knowledge. Based on the implementation of the concept of Outcome Based Education, we use Knowledge extraction, fusion, reasoning techniques to construct a hierarchical knowledge graph with the main line of "knowledge-course-graduation requirements. In the process of knowledge extraction, in order to alleviate the huge labor overhead brought by traditional extraction methods, this paper adopts a transfer learning method to extract triadic knowledge using the multi-task framework EERJE, Finally, knowledge reasoning was also performed with the help of LLM to further expand the knowledge scope. The comprehensiveness, correctness and relatedness of the data were evaluated through the experiment, and the F1 value of the ternary group extraction was 87.76%, the accuracy rate of entity classification was 85.42%, the data coverage was more comprehensive, and the results showed that the data quality was better, and the knowledge graph constructed in this way can fully optimize the organization and management of teaching resources, help students intuitively and comprehensively grasp the correlation and difference between graduation requirements and various knowledge points, and let the Students can carry out personalized independent learning through the navigation mode of knowledge graph, strengthen their weak links, and complete the relevant graduation requirements, which effectively improves the degree of students’ graduation requirements achievement. This new paradigm of knowledge graph enabled teaching is of reference significance for engineering education majors to improve the degree of graduation requirements achievement.

Citation: Yang Y, Chen S, Zhu Y, Zhu H, Chen Z (2023) Knowledge graph empowerment from knowledge learning to graduation requirements achievement. PLoS ONE 18(10): e0292903. https://doi.org/10.1371/journal.pone.0292903

Editor: Sathishkumar Veerappampalayam Easwaramoorthy, Sunway University, MALAYSIA

Received: June 4, 2023; Accepted: September 30, 2023; Published: October 12, 2023

Copyright: © 2023 Yang et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: Data is available at https://github.com/aichiroudemao/database.

Funding: This research was funded by the Henan Provincial Higher Education Teaching Reform Research and Practice Project (2021SJGLX017), and the North China University of Water Resources and Electric Power Education Reform Project. They played a role in the decision to publish this step.

Competing interests: The authors declare no competing interests.

Introduction

Knowledge are an essential foundational component in the educational and teaching process, and the learning of knowledge is usually accomplished through specific learning activities. At the same time, the design of learning activities is intended to achieve graduation requirement indicators. The graduation requirement attainment system in colleges and universities requires that each course and each knowledge unit closely supports graduation requirements and comprehensively covers graduation requirements [1], and all specific knowledge points together guide students from knowledge learning to graduation requirement attainment. knowledge of existing syllabi, instructional design, and web pages about training programs are usually presented independently in the form of unstructured texts. Redundant descriptions and information overload make it easy for students to miss key effective information and lead to unclear perception of the connection between knowledge and graduation requirements, which may prevent students from understanding the practical use and value of knowledge in the learning process and make it difficult to clarify the specific steps and ways to achieve graduation requirements. For example, when engineering management students learn the knowledge point of "original voucher" in the course of "Accounting", if they do not know the graduation requirement of "engineering knowledge", they may only superficially understand that "original voucher" is the initial source of recording financial transactions, but they cannot relate it to the cost and quality of engineering, so they cannot precisely and deeply reach the graduation requirement of "engineering knowledge". For example, when engineering management students learn the knowledge point of "original voucher" in the course of "Accounting", if they do not know the graduation requirement of "engineering knowledge", they may only superficially understand that "original voucher" is the initial source of recording financial transactions, but they cannot relate it to the cost and quality of engineering, so they cannot precisely and deeply reach the graduation requirement of "engineering knowledge". Therefore, it is particularly urgent and necessary to comprehensively construct the connection between knowledge units and graduation requirements.

The UN Summit on Transforming Education recently released the Initiative for Action to Ensure and Improve the Quality Public Digital Learning for All [2], which states that countries can take full advantage of digital technologies to empower teaching and learning. As a new generation of information technology, knowledge graph is a powerful tool for computer-supported knowledge organization and visual presentation, Existing generative AI has randomness in answering questions due to the massive breadth of the training data, which makes it very easy to generate answers that are not related to the current question and need to be restricted. In contrast, knowledge graphs can extract structured information from the target document, which in turn provides structured knowledge support for applications such as retrieval and Q&A, and knowledge graphs contain clear inter-entity relationships, which help machines better understanding of semantics, thus improving the precision and accuracy of information processing. In addition, for answering questions in vertical domains, generative AI may introduce other domain noises, and knowledge graph can provide regular and accurate professional information for downstream applications, which is easier to meet the specialized needs of vertical domains. Therefore, in order to help students understand the connection between various types of knowledge and graduation requirements more intuitively and comprehensively, This paper constructs a hierarchical knowledge graph with the main line of "knowledge-course-graduation requirements" to support students in transitioning from knowledge learning to the attainment of graduation goals. Specifically, this paper analyzes various teaching resources such as training programs and syllabi to clarify the specific ability requirements and knowledge requirements covered in graduation requirements. It breaks down graduation requirement indicators, summarizes the relationship between graduation requirement, course, and knowledge, and defines the ontology layer of the knowledge graph. In the process of constructing the graph, entity relationship extraction is one of the key technologies. Entity relationship extraction is an information extraction method based on natural language processing and machine learning techniques, which can identify and extract entities and relationships from unstructured or semi-structured texts and represent them as structured data in the form of triplets and knowledge graphs. Existing extraction methods based on summary sentence types [3–5] or sequence annotations [6–8] are often accompanied by great labor overhead, and to alleviate this labor overhead, This article describes the use of the APICKnow [9] to define an Education Entity Relationship Joint Extractor(EERJE) for extracting knowledge triplets. Using the graph database Neo4j as the knowledge storage carrier, the triplet knowledge is presented in the form of a knowledge graph. The graph, by connecting various related entities, constructs a complete knowledge structure. This comprehensive structure can help students more intuitively understand the various abilities and knowledge required for graduation requirements, and can help them clarify their current learning progress and future learning goals, thereby enhancing their intrinsic motivation for learning. The knowledge graph can guide students to gradually master each knowledge unit to precisely achieve the graduation requirements, thus comprehensively improving students’ degree of graduation goal attainment.

The main work of this paper includes:

We analyze the content characteristics of heterogeneous data sources and define clear entity types and inter-entity relationship types based on the main line of "knowledge-course-graduation requirements" to build a knowledge graph ontology layer.
We use EERJE, a joint entity-relationship extraction framework, to perform knowledge triad extraction and build a knowledge graph that supports learning from knowledge toward graduation requirements achievement.
Utilizing LLM for entity relationship reasoning to further expand the scope of knowledge.
Evaluate the quality and use effect of the graph through experiments, and the results show that the graph allows students to clearly understand the links and differences between the various knowledge points that support graduation requirements, facilitating knowledge sorting and learning, while students can use the content of the knowledge graph according to their own learning mastery, jumping to the mastery of the knowledge of the weak knowledge points to learn, so that they can be personalized and independent learning, so that students can efficiently and accurately complete the relevant graduation requirements, and thus improve the degree of achievement of graduation requirements. This enables students to complete the graduation requirements efficiently and accurately, thus improving the degree of fulfillment of graduation requirements.

Related work

The development of the knowledge graph concept has gone through the semantic web [10]-ontology-world wide web [11]-semantic web [12]-linked data [13]. It wasn’t until May 2012 that Google officially introduced and published the concept of the Knowledge Graph [14], which is a general semantic knowledge formalization framework. It primarily uses a graph-based data model to capture knowledge in application scenarios involving integration [15], and employs visualization techniques to describe knowledge resources and their carriers, mine, analyze, construct, draw, and display knowledge and their interconnections, providing an efficient way to organize, manage, and analyze massive amounts of data [16], and Knowledge graph has become an essential tool for semantic analysis with the development of natural language processing and deep learning. A high-quality knowledge graph is handy for building a high-performance knowledge-driven application [17], so knowledge graphs are being applied to various fields of life as a highly regarded hot technology, and have shown its great application prospects in all walks of life.

Since knowledge graphs as large-scale knowledge engineering are crucial for the organization and integration of knowledge [18], entity-relationship extraction techniques can be used to integrate the data in the form of entities, inter-entity relationships, and attributes of entities extracted from unstructured or semi-structured text data through triads, such as <head entity, relationship, tail entity>, <entity, attribute, attribute value>, so that fragmented and abstract knowledge can be displayed clearly and visually. In the field of education alone, several scholars have conducted corresponding researches using knowledge graphs. For example, Chen et al. [19] extract concepts of subjects or courses, and then identify educational relationships among the concepts. Ding et al. [20] analyzed the research hotspots and evolution trends in the field of intelligent education research. Chen et al. [21] conducted an in-depth analysis of the mathematical discipline.

Although the aforementioned studies targeting the field of education have been widely applied in terms of subject concepts, teaching research hotspots, and the discipline of mathematics, and have utilized related knowledge to construct graphs, no one has yet effectively organized the related contents corresponding to graduation requirements. In order to help teachers and students better understand the content of their graduation requirements, we will make a specific study for this. In this paper, we will deeply analyze the relevant text data, realize the association between different entities through the fusion of multi-source heterogeneous data, build a knowledge graph with graduation requirements as the core, It makes the correlation and difference between each knowledge point and the graduation requirements more intuitively displayed, helps students to better understand the specific knowledge points corresponding to graduation requirements, and to reach the graduation requirements step by step according to the content contained in the knowledge graph, in order to improve the degree of graduation requirement achievement.

Knowledge system description

Based on the OBE concept, this paper analyzes the relevant text data in depth and summarizes the main structure of "Knowledge-Course-Graduation Requirements" to help students learn the knowledge points, complete the corresponding course objectives, and acquire knowledge, skills and professionalism to meet the graduation requirements. The types and definitions of each level are shown in Table 1.

Download:

Table 1. Types included in each main line structure and related descriptions.

https://doi.org/10.1371/journal.pone.0292903.t001

Knowledge graph structure

The data sources of the knowledge graph constructed in this paper are syllabus, teaching design, web knowledge about cultivation program, etc., mainly unstructured and semi-structured text corpus, and the graph constructed in this paper belongs to vertical domain knowledge graph according to the division of knowledge coverage and knowledge mining depth. Vertical domain knowledge graphs are oriented to fixed domains, and the degree of mining knowledge in fixed domains is deeper and requires higher quality data. From the above two aspects, this paper will adopt the top-down approach to construct the knowledge graph, and the overall construction process is shown in Fig 1. First, the web knowledge data of syllabus, teachers’ lesson plans, and relevant training programs should be obtained, and the obtained data should be redundantly filtered to remove paragraphs that are irrelevant to graduation requirements, course objectives, and knowledge units, and the filtered paragraphs should be divided into sentences using PyLTP [22] with a period as a separator, and the ontology structure of the knowledge graph should be constructed on this basis, which is used for the graduation requirement-related content modeling and specification; second, the manual annotated text data is used to train the federated extraction framework EERJE in order to extract entities and inter-entity relationships from other text data; then, the unstructured and semi-structured text data are fused for multi-word one-sense problems using techniques such as entity linking, and the structured text data are merged with relational databases; Then use LLM for knowledge reasoning to further expand the data range; finally, the graph database is used to The more popular Neo4j database [23] is used to store the data layer of the knowledge graph.

Download:

Fig 1. Overall build process.

https://doi.org/10.1371/journal.pone.0292903.g001

Ontology frame design

The most important aspect of the construction of the knowledge graph is the design of the ontology, which can formally represent the hierarchical relationships between entities [24]. The design of the ontology structure determines whether the graph can facilitate the application in this domain. Therefore, this paper defines a total of 10 entity types and 11 relationship types based on the description of the knowledge system and under the guidance of professional teachers, The ontology frame design is shown in Fig 2.

Download:

Fig 2. Ontology frame design.

https://doi.org/10.1371/journal.pone.0292903.g002

Knowledge extraction

Data processing.

In this paper, we mainly use publicly available syllabi, teachers’ lesson plans, and the content of web pages about training programs as data sources. After obtaining the relevant data, the first step is to filter out the redundant descriptions that are not related to graduation objectives, course system and knowledge units by paragraph screening, and then use PyLTP to process the filtered paragraphs with periods as separators in separate sentences. After manual annotation by hand, the processed text data are input to the joint extraction framework EERJE for training in order to extract entities and inter-entity relationships.

Entity relationship extraction.

The overall processing flow of EERJE, the multitasking framework used in this paper, is a dynamic hint generator combined with a predefined schema to generate dynamic hint sequences for the input text, which in turn leads to a joint entity-relationship extractor to extract both entities and inter-entity relationships from the current input text. The overall processing flow is shown in Fig 3.

Download:

Fig 3. EERJE overall processing flow.

https://doi.org/10.1371/journal.pone.0292903.g003

Multitasking framework EERJE can be divided into three parts related to the introduction: Dynamic prompt generator; Dynamic prompt sequences; Entity-Relationship Joint Extractor.

I Dynamic prompt generator. The kernel of the dynamic prompt generator is a Bert-based text classifier [25], which generates a list of candidate relations by inputting text to predict the types of relations that may exist in an utterance. However, it is worth noting that the number of this list of candidate relations should not exceed three, because the entity-relationship joint extractor extracts multiple entity-relationship triples present in a sentence based on the dynamic prompt sequence generated by the Dynamic prompt generator. In general, although a statement does not contain all relationship types at the same time, the entity-relationship joint extractor receives more noise as the number of defined relationship types increases. To avoid interference, let the dynamic prompt generator go ahead and narrow down the hints by outputting only the top three most likely relationship types.

II Dynamic prompt sequence. The dynamic prompt sequence is composed of entity types and relationship types combined with the current input text data in the form of [spot], [asso], [text], where [spot] refers to the entity type, [asso] refers to the relationship between entities, and [text] refers to the input text.

III Entity-Relationship Joint Extractor. The Entity-Relationship Joint Extractor is a sequence-to-sequence framework and the overall generation process can be expressed by Eq (1): (1)

Among them, ERJE refers to entity relationship joint extractor, which is implemented based on T5 [26] of Transformer [27], [I₁, I₂, … …, I_|I|] denotes dynamic prompt sequence, and [I₁, I₂, … …, I_|I|] denotes generated sequence, i.e., the dynamic prompt sequence I generated by dynamic prompt generator is input to T5, and the sequence Y is generated by T5 to obtain the triadic knowledge of entities as well as inter-entity relationships.

Knowledge extraction results. In this paper, we take texts such as syllabi, instructional designs, and webpage knowledge data about training programs as the extraction corpus, and use EERJE to extract entity-relationship triples from this corpus. The cumulative number of entity relationship triples extracted is shown in Table 2.

Download:

Table 2. Entity relationship extraction results.

https://doi.org/10.1371/journal.pone.0292903.t002

Knowledge fusion

In order to improve the quality of knowledge base construction [28], knowledge fusion of extracted entities is required, mainly to solve two problems: I. Unifying the representation of the same entity type in heterogeneous texts from multiple sources: Due to the interference of objective factors when different data sources describe the same entity, an entity may have multiple representations, which is the phenomenon of multiple words with one meaning. It is necessary to perform coreference resolution to standardize the entity representation. For example, during the extraction of input text data, the entity "Engineering Management major" is extracted from the teaching outline, while the entity "EngMgmt" is extracted from the teacher’s lesson plan. These two entities represent different expressions for the "Engineering Management major" and need to be merged through a fusion operation. To address this phenomenon, machine learning algorithms can be employed to merge entities, build models using training data, and determine whether two entities can be fused based on the similarity of their attributes. Compared to traditional rule-based or similarity-based methods, using machine learning algorithms for entity merging does not require manual setting of rules or weights. II. Resolving entity ambiguity issues. In data sources, an entity’s representation may have two different meanings, which is the phenomenon of one word with multiple meanings, necessitating entity disambiguation. For example, “Introduction to Networks” can refer to both " Computer Networks " and " Social Networks ". These two interpretations of "Introduction to Networks" represent different subject areas—one in computer science and the other in social sciences. To address this one word with multiple meanings phenomenon, we can first provide a list of target entities and set rules to filter out unlikely target entities. This process helps identify candidate entities. Then, an entity linking method is employed to determine the true target entity that a mention refers to within the candidate set [29].

Knowledge reasoning

In order to further expand the scope of the data, this paper uses the knowledge graph constructed with graduation requirements as the neural knowledge base, and lets a large-scale language model (e.g., GPT3.5) learn the knowledge contained in the knowledge graph and reason out more entity relationships related to the hierarchy of "Knowledge-Course-Graduation Requirements" through three modules. The three modules are: Entity Extractor, Entity Knowledge Parser and Entity Relationship Decision.

Entity extractor module

The Entity extractor module consists of entity extraction unit and entity pairing unit. That is, given a text, first with the help of the LLM extraction of the 10 types of entities defined in this paper, and then two and two pairs to form (entity_1, entity_2) form, and finally output entity pairs. The extraction case and output are shown in Fig 4.

Download:

Fig 4. Entity extractor module.

https://doi.org/10.1371/journal.pone.0292903.g004

Entity knowledge parser module

The Entity knowledge parser module consists of an Entity Knowledge Mining unit and an Entity Knowledge Paring unit. That is, given an entity pair, LLM mines the knowledge related to each entity in the entity pair, and then combines the two pieces of knowledge into a complete and integrated set of paragraphs. Considering that the module needs to reason about 11 types of relationships, a separate knowledge parser is designed for each relationship type to accomplish the related knowledge mining work through a parallel approach. In addition, in order to better mine the knowledge related to the relationship types, take the relationship type "belongs to" as an example, the prompts are also designed as "what knowledge units does the entity contain" and "in which courses does the entity appear? ". The other relationship types are the same as above, replaced with the relevant prompts. Example input and output as shown in Fig 5.

Download:

Fig 5. Entity knowledge parser module.

https://doi.org/10.1371/journal.pone.0292903.g005

Entity relationship decision module

The Entity relationship decision module consists of three relationship decision units and a result combination unit. That is, given entity pairs and related knowledge, the three decision units independently evaluate inter-entity relationships from different perspectives and styles (direct questioning, judging statements to be true or false, and formulating options), and then the result combination unit aggregates the outputs of the three decision units to finalize the inter-entity relationships through majority voting. In order to improve the accuracy, separate relationship decision makers are still designed for the 11 relationship types for parallel operation. In addition, due to the polysemous nature of these 11 relationship types, a single word may cause ambiguity and lead to invalid responses generated by LLM. For example, if the relationship type "belongs to" is associated with different knowledge in different domains, LLM may generate an ambiguous and invalid answer such as "they have a certain relationship with each other" instead of making a "yes" or "no" answer. " or "No" instead of making a direct judgment. For this reason, this paper replaces each relationship with a description of that relationship in each unit prompt, e.g., "belongs to" can be described as "a knowledge unit belongs to a course". Other relationship types as above need to be replaced with specific explanations. The specific output is shown in Fig 6.

Download:

Fig 6. Entity relationship decision module.

https://doi.org/10.1371/journal.pone.0292903.g006

Graph database storage

Neo4j uses a graphical data model, has good scalability, can handle complex relational data and massive data, and its use of nodes and edges to represent the data, this storage structure and the real world of relational description is very close to people in the use of Neo4j query related information will be quite convenient. In addition, Neo4j supports complex relational queries, full-text indexing and advanced search, making it easier to query and analyze the knowledge graph. Considering that the later expansion and application of the knowledge graph require complex semantic search and also storage of large amount of data, etc., this paper will use Neo4j for data layer storage.

The entity relationship triad extracted by the federated extraction framework EERJE is imported into Neo4j. As an example, many entities and their relationships, such as graduation requirements, graduation requirement index points, course names, prerequisite courses, course objectives, textbook names, experiment publishers, etc. related to engineering management majors are shown in Fig 7 to demonstrate the specific storage form. The major consists of the following courses: Engineering Economics, Engineering Cost, Introduction to Engineering Graphics B, and Engineering Project Management. Among them, Engineering Project Management is a prerequisite course for both Engineering Economics and Engineering Cost. Similar relationships exist among other courses. By establishing such prerequisite relationships, connections between courses are established [30]. In addition, graduation requirements are linked to graduation requirement index points, graduation requirement index points are linked to course names, course names are linked to textbook names, experiments, knowledge units, etc. By connecting entities to relationships and relationships to entities, the relevant content of the entire program is related, forming a complete structural system from specific knowledge points to graduation requirements.

Download:

Fig 7. Engineering management professional entity relationship storage form.

https://doi.org/10.1371/journal.pone.0292903.g007

Knowledge graph evaluation and analysis

In order to construct a high standard knowledge graph, it is necessary to assess its data quality and usage effect; for the assessment of quality, Chen et al. [31] had developed a practical framework for the quality assessment of knowledge graphs, which summarizes the detailed basic quality requirements for each of the five common categories of knowledge graph-based applications, and each category of applications has different evaluation indexes. As for the quality assessment of vertical domain knowledge graphs, Nguyen et al. [32] proposed a detailed assessment of data quality, mainly assessing its correctness, relatedness, and comprehensiveness; in order to validate the quality of the knowledge graph constructed in this paper, it will be evaluated with the help of the three dimensions of correctness, relatedness, and comprehensiveness developed by Nguyen et al. For the assessment of the effectiveness of its use, this paper will use a questionnaire to allow students to evaluate themselves in order to determine whether the knowledge graph allows students to personalize their learning in order to improve the degree of graduation requirement attainment.

Assessment of data quality

The assessment of data quality is mainly done in terms of correctness, relatedness, and comprehensiveness, where correctness means that the extracted entities/triples are accurate, relatedness means that the extracted entities should be related to the defined types, and comprehensiveness means the information covered by the constructed graph.

For the assessment of correctness, this paper takes the manually labeled initial data and divides the initial text into training set, validation set and test set according to the ratio of 8:1:1 in accordance with the commonly used methods of corpus segmentation, and conducts labeling test for all kinds of relation triples in each utterance, and adopts the values of P, R, and F1 as the indicators of the model extraction performance assessment, and the formula of each indicator is as follows: (2) (3) (4)

In the above formula, TP denotes the positive sample predicted to be a positive case, FP denotes the negative sample predicted to be a positive case, FN denotes the positive sample predicted to be a negative case, and F1 is the reconciled mean of P and R. The specific results are shown in Table 3.

Download:

Table 3. Results for each type of triple.

https://doi.org/10.1371/journal.pone.0292903.t003

For the assessment of relevance, this paper first uses a sampling method [33] from the extracted entities to select 10% of the data to ensure that the indicators observed in the sample are within a certain confidence interval and generalized to the overall population at a certain confidence level. Secondly, three undergraduate students majoring in education who have not participated in this experiment are invited, two of them annotate whether the extracted entities are related to the entity types defined in this paper, and the third student makes the final decision when the two results are contradictory, and the Kappa coefficient of the annotations [34] is 0.91, which can be regarded as a basic consistency of the annotation results. Finally, the annotated data is used to train the bert-based classifier, and the detailed results are shown in Table 4.

Download:

Table 4. Classification results.

https://doi.org/10.1371/journal.pone.0292903.t004

For the assessment of comprehensiveness, since this paper is based on the construction of the knowledge graph for the graduation requirements of engineering management majors, and the purpose is to provide reference for other engineering education majors to improve the degree of graduation requirements achievement based on this new paradigm of teaching empowered by the knowledge graph, for the assessment of comprehensiveness, this paper invites six engineering management majors’ teachers, who have an average of 12 years of teaching experience, to assess the specific scope covered by the entity types and relationship types as well as the extracted ternary group, for the entity types and relationship types, all five teachers agreed that the entity types and relationship types defined in this paper can cover the actual teaching and learning. For entity types and relationship types, five teachers agree that the entity types and relationship types defined in this paper can cover the contents required in actual teaching and learning, and for the contents of the extracted triad, four teachers think that the contents of the syllabus, teaching design and the related cultivation webpage can be more comprehensive and structured with structured knowledge. The four teachers believe that the contents of the syllabus, teaching design and the knowledge of the related cultivation webpage can be represented in a more comprehensive and structured way, which can provide an intuitive and comprehensive visualized knowledge graph for students.

Data quality assessment analysis

For the assessment of correctness, from the results shown in Table 3, the mean values of P, R, and F1 reached 87.07%, 88.34%, and 87.76% respectively which can be recognized as high correctness for the extraction of the ternary group. For the assessment of relevance, from the results in Table 4, the scores of Accuracy are above 0.80, indicating that the extracted entities can be considered as the correct entity types, reflecting high relevance. As for the assessment of comprehensiveness, by combining the views of the six teachers, it can be concluded that the data have good comprehensiveness. The above results indicate that the quality of the data is better.

Use effect evaluation

This paper takes the engineering management major of North China University of Water Resources and Electric Power as an example, and calculates the students’ course goal achievement degree by collecting the test scores, experimental scores, and regular scores before and after the use of knowledge graph by the students of the major, and then weights them to get the final graduation requirement achievement degree of each item. The course goal achievement degree is weighted by the test scores, experimental scores and regular scores, which is the basis of the graduation requirement achievement degree. The specific calculation formula is shown in Eq (5), where A is the usual grade, B is the examination grade, and C is the experimental grade, all of which are in percentage.

(5)

The degree of graduation requirement achievement is the sum of all the courses that support the graduation requirements. If there are a total of H graduation requirement index points for the nth graduation requirement supported by i courses. Among them, the target attainment of i courses is S_i, and the weight of Hu for the nth graduation requirement is W_x, then the attainment of H_u for the nth graduation requirement is shown in Eq (6).

(6)

The achievement degree H_n of the nth graduation requirement is the sum of the achievement degree H_u of the corresponding index point multiplied by the weight W_l, which is calculated as shown in Eq (7).

(7)

The graduation requirement attainment degree is the sum of n graduation requirement attainment degrees multiplied by the weight W_p, which is calculated as shown in Eq (8).

(8)

Evaluation and analysis of the effectiveness of use

Based on the calculation of graduation requirement attainment of engineering management students who use knowledge graph and those who do not use knowledge graph as described in this paper, The results of the achievement of all graduation requirements are shown in Fig 8:

Download:

Fig 8. Results of graduation requirement attainment data.

https://doi.org/10.1371/journal.pone.0292903.g008

According to the results of the above data analysis, students who used knowledge graphs generally outperformed students who did not use knowledge graphs in terms of graduation requirement attainment. The main reason for this phenomenon is that the contents related to graduation requirements are scattered in the text, and it is difficult for students to grasp the connection between knowledge and graduation requirements as a whole, and they are not clear about the leading and following courses or specific knowledge points corresponding to graduation requirements, while the knowledge graph is related, structured and navigable, so students can find out the courses and specific knowledge related to each graduation requirement intuitively and efficiently, and they can navigate to the corresponding contents according to the graph for their weak links, so that students can personalize their own learning and complete graduation requirements efficiently and accurately. The results show that constructing a knowledge graph for correlation between knowledge and graduation requirements can effectively improve students’ achievement of graduation requirements, and also prove the necessity and effectiveness of constructing this graph.

Conclusion and prospect

This study combines the OBE concept with the in-depth analysis of texts such as syllabus, instructional design, and web knowledge about educational contents to obtain the hierarchical structure of "knowledge-course-graduation requirement", and defines 10 types of entities and 11 types of relationships. The ontology is constructed by pre-processing the collected data, and the joint extraction framework EERJE is trained to better extract entity relationship types after using manual annotation of the data, Finally, the content contained in the knowledge graph is also utilized as a neural knowledge base, Allow LLM to learn and then be able to reason about other textual data through the three modules to obtain more entity-relationship triples to further expand the data coverage. Experiments were conducted to assess the quality of the knowledge graph data and the effectiveness of its use, and the results showed that the data quality are better, and the knowledge graph constructed in this way could integrate and visualize the highly dispersed content of graduation requirements, to carefully portray the correlation and difference between knowledge points and graduation requirements, and to build a complete framework, so that students can jump to the corresponding knowledge points according to their own mastery of learning through the contents of the knowledge graph, through this knowledge navigation, so that students can carry out personalized independent learning, strengthen their mastery of knowledge and complete the graduation requirements one step at a time, which improves students’ degree of achievement of graduation requirements.

Future work prepares to build a search engine that can support talent development content, assist university teachers in formulating education content with clear objectives and logic, help students complete their professional learning independently, acquire knowledge, skills and professionalism, and lay the foundation for cultivating high-quality talents.

References

1. Xingbin M., Ning S., Yanmin L., Fei Y., Wei W., Yingbin H., et al. (2023). Application of the integrated teaching model of theory and practice based on outcome-based education principles in clerkship teaching of digestive system. Chinese Journal of Medical Education, 43(3), 199.
- View Article
- Google Scholar
2. Digital Learning for all (no date) United Nations. https://www.un.org/en/transforming-education-summit/digital-learning-all (Accessed: 30 May 2023).
3. Culotta, A., McCallum, A., & Betz, J. (2006, June). Integrating probabilistic extraction models and data mining to discover relations and patterns in text. In Proceedings of the Human Language Technology Conference of the NAACL, Main Conference (pp. 296–303).
4. Huang Q., Yuan Z., Xing Z., Zuo Z., Wang C., & Xia X. (2022). 1+ 1$> $2: Programming Know-What and Know-How Knowledge Fusion, Semantic Enrichment and Coherent Application. IEEE Transactions on Services Computing.
- View Article
- Google Scholar
5. Li, H., Li, S., Sun, J., Xing, Z., Peng, X., Liu, M., et al. (2018, September). Improving api caveats accessibility by mining api caveats knowledge graph. In 2018 IEEE International Conference on Software Maintenance and Evolution (ICSME) (pp. 183–193). IEEE.
6. Zheng, S., Wang, F., Bao, H., Hao, Y., Zhou, P., & Xu, B. (2017). Joint extraction of entities and relations based on a novel tagging scheme. arXiv preprint arXiv:1706.05075.
7. Wei, Z., Su, J., Wang, Y., Tian, Y., & Chang, Y. (2019). A novel cascade binary tagging framework for relational triple extraction. arXiv preprint arXiv:1909.03227.
8. Wang, Y., Yu, B., Zhang, Y., Liu, T., Zhu, H., & Sun, L. (2020). TPLinker: Single-stage joint extraction of entities and relations through token pair linking. arXiv preprint arXiv:2010.13415.
9. Yang Y., Zhu Y., Chen S., & Jian P. (2023). API comparison knowledge extraction via prompt-tuned language model. Journal of Computer Languages, 75, 101200.
- View Article
- Google Scholar
10. Lehmann F. (1992). Semantic networks. Computers & Mathematics with Applications, 23(2–5), 1–50.
- View Article
- Google Scholar
11. Berners-Lee, T. J. (1989). Information management: A proposal (No. CERN-DD-89-001-OC).
12. Berners-Lee, T. (1998). Semantic web road map.
13. Berners-Lee T. Linked data. Int. J. on Semantic Web and Information Systems, 2006, 4(2).
- View Article
- Google Scholar
14. Wang Q., Ji Y., Hao Y., & Cao J. (2020). GRL: Knowledge graph completion with GAN-based reinforcement learning. Knowledge-Based Systems, 209, 106421.
- View Article
- Google Scholar
15. Hogan A., Blomqvist E., Cochez M., d’Amato C., Melo G. D., Gutierrez C., et al. (2021). Knowledge graphs. ACM Computing Surveys (Csur), 54(4), 1–37.
- View Article
- Google Scholar
16. Chen Z., Wang Y., Zhao B., Cheng J., Zhao X., & Duan Z. (2020). Knowledge graph completion: A review. Ieee Access, 8, 192435–192456.
- View Article
- Google Scholar
17. Tang, M., Su, C., Chen, H., Qu, J., & Ding, J. (2020, December). SALKG: a semantic annotation system for building a high-quality legal knowledge graph. In 2020 IEEE International Conference on Big Data (Big Data) (pp. 2153–2159). IEEE.
18. Chen X., Jia S., & Xiang Y. (2020). A review: Knowledge reasoning over knowledge graph. Expert Systems with Applications, 141, 112948.
- View Article
- Google Scholar
19. Chen P., Lu Y., Zheng V. W., Chen X., & Yang B. (2018). Knowedu: A system to construct knowledge graph for education. Ieee Access, 6, 31553–31563.
- View Article
- Google Scholar
20. Shi D., Zhou J., Wang D., & Wu X. (2022). Research Status, Hotspots, and Evolutionary Trends of Intelligent Education from the Perspective of Knowledge Graph. Sustainability, 14(17), 10934.
- View Article
- Google Scholar
21. Chen, P., Lu, Y., Zheng, V. W., Chen, X., & Li, X. (2018, June). An automatic knowledge graph construction system for K-12 education. In Proceedings of the fifth annual ACM conference on learning at scale (pp. 1–4).
22. http://www.ltp-cloud.com
23. Sharma, M., Sharma, V. D., & Bundele, M. M. (2018, November). Performance analysis of RDBMS and no SQL databases: PostgreSQL, MongoDB and Neo4j. In 2018 3rd International Conference and Workshops on Recent Advances and Innovations in Engineering (ICRAIE) (pp. 1–5). IEEE.
24. Chatterjee A., Nardi C., Oberije C., & Lambin P. (2021). Knowledge graphs for COVID-19: An exploratory review of the current landscape. Journal of personalized medicine, 11(4), 300. pmid:33919882
- View Article
- PubMed/NCBI
- Google Scholar
25. Yu S., Su J., & Luo D. (2019). Improving bert-based text classification with auxiliary sentence and domain knowledge. IEEE Access, 7, 176600–176612.
- View Article
- Google Scholar
26. Vaswani A., Shazeer N., Parmar N., Uszkoreit J., Jones L., Gomez A. N., et al. (2017). Attention is all you need. Advances in neural information processing systems, 30.
- View Article
- Google Scholar
27. Raffel C., Shazeer N., Roberts A., Lee K., Narang S., Matena M., et al. (2020). Exploring the limits of transfer learning with a unified text-to-text transformer. The Journal of Machine Learning Research, 21(1), 5485–5551.
- View Article
- Google Scholar
28. Zhao X., Jia Y., Li A., Jiang R., & Song Y. (2020). Multi-source knowledge fusion: a survey. World Wide Web, 23, 2567–2592
- View Article
- Google Scholar
29. Eshel, Y., Cohen, N., Radinsky, K., Markovitch, S., Yamada, I., & Levy, O. (2017). Named entity disambiguation for noisy text. arXiv preprint arXiv:1706.09147.
30. Yang Y., Zhu Y., & Jian P. (2022). Application of Knowledge Graph in Water Conservancy Education Resource Organization under the Background of Big Data. Electronics, 11(23), 3913.
- View Article
- Google Scholar
31. Chen, H., Cao, G., Chen, J., & Ding, J. (2019). A practical framework for evaluating the quality of knowledge graph. In Knowledge Graph and Semantic Computing: Knowledge Computing and Language Understanding: 4th China Conference, CCKS 2019, Hangzhou, China, August 24–27, 2019, Revised Selected Papers 4 (pp. 111–122). Springer Singapore.
32. Nguyen, H., Chen, H., Chen, J., Kargozari, K., & Ding, J. (2023). Construction and evaluation of a domain-specific knowledge graph for knowledge discovery. Information Discovery and Delivery.
33. Singh R., & Mangat N. S. (2013). Elements of survey sampling (Vol. 15). Springer Science & Business Media.
34. Landis J. R., & Koch G. G. (1977). An application of hierarchical kappa-type statistics in the assessment of majority agreement among multiple observers. Biometrics, 363–374. pmid:884196
- View Article
- PubMed/NCBI
- Google Scholar

[ref1] 1. Xingbin M., Ning S., Yanmin L., Fei Y., Wei W., Yingbin H., et al. (2023). Application of the integrated teaching model of theory and practice based on outcome-based education principles in clerkship teaching of digestive system. Chinese Journal of Medical Education, 43(3), 199.
View Article
Google Scholar

[2] View Article

[3] Google Scholar

[ref2] 2. Digital Learning for all (no date) United Nations. https://www.un.org/en/transforming-education-summit/digital-learning-all (Accessed: 30 May 2023).

[ref3] 3. Culotta, A., McCallum, A., & Betz, J. (2006, June). Integrating probabilistic extraction models and data mining to discover relations and patterns in text. In Proceedings of the Human Language Technology Conference of the NAACL, Main Conference (pp. 296–303).

[ref4] 4. Huang Q., Yuan Z., Xing Z., Zuo Z., Wang C., & Xia X. (2022). 1+ 1$> $2: Programming Know-What and Know-How Knowledge Fusion, Semantic Enrichment and Coherent Application. IEEE Transactions on Services Computing.
View Article
Google Scholar

[7] View Article

[8] Google Scholar

[ref5] 5. Li, H., Li, S., Sun, J., Xing, Z., Peng, X., Liu, M., et al. (2018, September). Improving api caveats accessibility by mining api caveats knowledge graph. In 2018 IEEE International Conference on Software Maintenance and Evolution (ICSME) (pp. 183–193). IEEE.

[ref6] 6. Zheng, S., Wang, F., Bao, H., Hao, Y., Zhou, P., & Xu, B. (2017). Joint extraction of entities and relations based on a novel tagging scheme. arXiv preprint arXiv:1706.05075.

[ref7] 7. Wei, Z., Su, J., Wang, Y., Tian, Y., & Chang, Y. (2019). A novel cascade binary tagging framework for relational triple extraction. arXiv preprint arXiv:1909.03227.

[ref8] 8. Wang, Y., Yu, B., Zhang, Y., Liu, T., Zhu, H., & Sun, L. (2020). TPLinker: Single-stage joint extraction of entities and relations through token pair linking. arXiv preprint arXiv:2010.13415.

[ref9] 9. Yang Y., Zhu Y., Chen S., & Jian P. (2023). API comparison knowledge extraction via prompt-tuned language model. Journal of Computer Languages, 75, 101200.
View Article
Google Scholar

[14] View Article

[15] Google Scholar

[ref10] 10. Lehmann F. (1992). Semantic networks. Computers & Mathematics with Applications, 23(2–5), 1–50.
View Article
Google Scholar

[17] View Article

[18] Google Scholar

[ref11] 11. Berners-Lee, T. J. (1989). Information management: A proposal (No. CERN-DD-89-001-OC).

[ref12] 12. Berners-Lee, T. (1998). Semantic web road map.

[ref13] 13. Berners-Lee T. Linked data. Int. J. on Semantic Web and Information Systems, 2006, 4(2).
View Article
Google Scholar

[22] View Article

[23] Google Scholar

[ref14] 14. Wang Q., Ji Y., Hao Y., & Cao J. (2020). GRL: Knowledge graph completion with GAN-based reinforcement learning. Knowledge-Based Systems, 209, 106421.
View Article
Google Scholar

[25] View Article

[26] Google Scholar

[ref15] 15. Hogan A., Blomqvist E., Cochez M., d’Amato C., Melo G. D., Gutierrez C., et al. (2021). Knowledge graphs. ACM Computing Surveys (Csur), 54(4), 1–37.
View Article
Google Scholar

[28] View Article

[29] Google Scholar

[ref16] 16. Chen Z., Wang Y., Zhao B., Cheng J., Zhao X., & Duan Z. (2020). Knowledge graph completion: A review. Ieee Access, 8, 192435–192456.
View Article
Google Scholar

[31] View Article

[32] Google Scholar

[ref17] 17. Tang, M., Su, C., Chen, H., Qu, J., & Ding, J. (2020, December). SALKG: a semantic annotation system for building a high-quality legal knowledge graph. In 2020 IEEE International Conference on Big Data (Big Data) (pp. 2153–2159). IEEE.

[ref18] 18. Chen X., Jia S., & Xiang Y. (2020). A review: Knowledge reasoning over knowledge graph. Expert Systems with Applications, 141, 112948.
View Article
Google Scholar

[35] View Article

[36] Google Scholar

[ref19] 19. Chen P., Lu Y., Zheng V. W., Chen X., & Yang B. (2018). Knowedu: A system to construct knowledge graph for education. Ieee Access, 6, 31553–31563.
View Article
Google Scholar

[38] View Article

[39] Google Scholar

[ref20] 20. Shi D., Zhou J., Wang D., & Wu X. (2022). Research Status, Hotspots, and Evolutionary Trends of Intelligent Education from the Perspective of Knowledge Graph. Sustainability, 14(17), 10934.
View Article
Google Scholar

[41] View Article

[42] Google Scholar

[ref21] 21. Chen, P., Lu, Y., Zheng, V. W., Chen, X., & Li, X. (2018, June). An automatic knowledge graph construction system for K-12 education. In Proceedings of the fifth annual ACM conference on learning at scale (pp. 1–4).

[ref22] 22. http://www.ltp-cloud.com

[ref23] 23. Sharma, M., Sharma, V. D., & Bundele, M. M. (2018, November). Performance analysis of RDBMS and no SQL databases: PostgreSQL, MongoDB and Neo4j. In 2018 3rd International Conference and Workshops on Recent Advances and Innovations in Engineering (ICRAIE) (pp. 1–5). IEEE.

[ref24] 24. Chatterjee A., Nardi C., Oberije C., & Lambin P. (2021). Knowledge graphs for COVID-19: An exploratory review of the current landscape. Journal of personalized medicine, 11(4), 300. pmid:33919882
View Article
PubMed/NCBI
Google Scholar

[47] View Article

[48] PubMed/NCBI

[49] Google Scholar

[ref25] 25. Yu S., Su J., & Luo D. (2019). Improving bert-based text classification with auxiliary sentence and domain knowledge. IEEE Access, 7, 176600–176612.
View Article
Google Scholar

[51] View Article

[52] Google Scholar

[ref26] 26. Vaswani A., Shazeer N., Parmar N., Uszkoreit J., Jones L., Gomez A. N., et al. (2017). Attention is all you need. Advances in neural information processing systems, 30.
View Article
Google Scholar

[54] View Article

[55] Google Scholar

[ref27] 27. Raffel C., Shazeer N., Roberts A., Lee K., Narang S., Matena M., et al. (2020). Exploring the limits of transfer learning with a unified text-to-text transformer. The Journal of Machine Learning Research, 21(1), 5485–5551.
View Article
Google Scholar

[57] View Article

[58] Google Scholar

[ref28] 28. Zhao X., Jia Y., Li A., Jiang R., & Song Y. (2020). Multi-source knowledge fusion: a survey. World Wide Web, 23, 2567–2592
View Article
Google Scholar

[60] View Article

[61] Google Scholar

[ref29] 29. Eshel, Y., Cohen, N., Radinsky, K., Markovitch, S., Yamada, I., & Levy, O. (2017). Named entity disambiguation for noisy text. arXiv preprint arXiv:1706.09147.

[ref30] 30. Yang Y., Zhu Y., & Jian P. (2022). Application of Knowledge Graph in Water Conservancy Education Resource Organization under the Background of Big Data. Electronics, 11(23), 3913.
View Article
Google Scholar

[64] View Article

[65] Google Scholar

[ref31] 31. Chen, H., Cao, G., Chen, J., & Ding, J. (2019). A practical framework for evaluating the quality of knowledge graph. In Knowledge Graph and Semantic Computing: Knowledge Computing and Language Understanding: 4th China Conference, CCKS 2019, Hangzhou, China, August 24–27, 2019, Revised Selected Papers 4 (pp. 111–122). Springer Singapore.

[ref32] 32. Nguyen, H., Chen, H., Chen, J., Kargozari, K., & Ding, J. (2023). Construction and evaluation of a domain-specific knowledge graph for knowledge discovery. Information Discovery and Delivery.

[ref33] 33. Singh R., & Mangat N. S. (2013). Elements of survey sampling (Vol. 15). Springer Science & Business Media.

[ref34] 34. Landis J. R., & Koch G. G. (1977). An application of hierarchical kappa-type statistics in the assessment of majority agreement among multiple observers. Biometrics, 363–374. pmid:884196
View Article
PubMed/NCBI
Google Scholar

[70] View Article

[71] PubMed/NCBI

[72] Google Scholar

Figures

Abstract

Introduction

Related work

Knowledge system description

Knowledge graph structure

Ontology frame design

Knowledge extraction

Data processing.

Entity relationship extraction.

Knowledge fusion

Knowledge reasoning

Entity extractor module

Entity knowledge parser module

Entity relationship decision module

Graph database storage

Knowledge graph evaluation and analysis

Assessment of data quality

Data quality assessment analysis

Use effect evaluation

Evaluation and analysis of the effectiveness of use

Conclusion and prospect

References