
Bayesian Student Modeling in ITS
Document information
Author | Nhung Hausler |
instructor | Tanja Mitrovic |
School | University (Name not specified in document) |
Major | Computer Science |
Document type | Thesis |
Language | English |
Format | |
Size | 1.38 MB |
Summary
I.Normative Intelligent Tutoring Systems ITS A Bayesian Approach to Student Modeling
This thesis introduces a novel normative Intelligent Tutoring System (ITS) that leverages Bayesian networks and decision theory for optimal student modeling and pedagogical action selection (PAS). Unlike traditional ITSs with ad-hoc implementations of learning theories, this Bayesian student modeling approach guarantees optimal application of learning theories by representing student beliefs using probability distributions and utility functions. The system's adaptive decision-making capabilities allow for personalized learning experiences by dynamically adjusting teaching actions based on student performance. A core methodology involves using machine learning to construct and update the Bayesian network student model, incorporating real-world classroom evaluation data for continuous improvement.
1. Defining Normative Intelligent Tutoring Systems
The core of this section establishes the concept of a normative Intelligent Tutoring System (ITS). It contrasts this approach with traditional ITS designs. The thesis emphasizes that normative ITSs utilize Bayesian probability theory and decision theory to model student knowledge and select optimal teaching actions. This contrasts with traditional ITSs, which often employ ad-hoc methods lacking the optimality guarantee offered by a normative framework. A key advantage of the proposed Bayesian approach is its ability to optimally apply any precisely defined learning theory to the student, provided the theory is represented using normative methods (probability distributions and utility functions). The text highlights that traditional systems are not guaranteed to be optimal because their implementation of learning theories is often ad hoc. The importance of representing student beliefs within a Bayesian network and using decision-theoretic principles for action selection is stressed as fundamental to the efficacy of this new type of ITS. The inherent rationality of this system is emphasized, differentiating it from traditional systems which may exhibit irrational behaviour due to issues not found in a normative approach.
2. Components of a Normative ITS and Limitations of Traditional Approaches
This part details the crucial components of a normative ITS beyond the core Bayesian network and decision theory. It acknowledges the current limitations of natural language processing in ITS development, leading to the use of alternative interfaces. The section emphasizes the shift in goals from completely replacing teachers to supporting them, recognizing that computer-based instruction is not universally suitable for all students or topics. The role of the user interface in maintaining student motivation and reducing cognitive load is discussed. Effective interface design is crucial for ensuring system effectiveness and enhancing learning outcomes. This section makes a critical distinction between the capabilities and limitations of current ITS technology. A key takeaway is that even with significant advances in ITS development, the system's role is best defined as a supportive tool for teachers. This focus on supporting teachers, rather than replacing them, allows for more personalized instruction and optimal allocation of teacher time. This approach also acknowledges the realities of diverse learning needs and preferences among students.
3. Bayesian Networks Inference and Decision Theory in the Normative ITS
This section delves into the technical details of the proposed ITS architecture. It explains Bayesian networks, specifically addressing inference in multiply-connected networks, using Bayes' Theorem for efficient posterior probability calculations. The concept of 'beliefs' as an alternative to probabilities within the network is also explained. The document highlights the challenges in efficiently computing posterior probabilities, particularly in complex Bayesian networks. It introduces the concept of beliefs, which are computationally more efficient because they don't require normalization after each calculation, although their interpretation differs slightly from probabilities. A specific algorithm is described involving three phases—drafting, thickening, and thinning—used to process and simplify these complex networks. The foundation for decision-making within the ITS is established through an introduction to decision theory, explaining how it integrates uncertain beliefs with preferences to optimize behavior. This section offers a detailed explanation of the mathematical and computational underpinnings of the proposed system, emphasizing the careful consideration given to the complexity of calculations within the system and the need for efficient algorithms. The use of Bayesian networks and decision theory provides a robust framework for making informed pedagogical decisions.
4. Student Modeling within Psychological Frameworks and Existing ITS Approaches
This section places student modeling within various psychological frameworks, comparing and contrasting behaviorism, cognitivism, and constructivism. It examines how traditional ITSs with student models align with these frameworks. The section explores the limitations of traditional ITSs, particularly their incompatibility with constructivist principles. The use of tests to assess mastery is categorized as behaviorist, while the internal state modeling is identified as cognitivist. Constructivism is seen as contradictory to traditional ITS approaches due to the individualized nature of knowledge construction in this framework. This discrepancy emphasizes that traditional ITSs, which often decompose knowledge into parts, are inherently at odds with constructivist learning principles. Existing approaches to student modeling are categorized and evaluated, highlighting their advantages and disadvantages. This section provides important context, showcasing how the proposed normative ITS is grounded within educational psychology and improves upon limitations of previous approaches. Various existing student modeling approaches are reviewed, including fixed stereotyping (Milne et al., 1996), feature-based modeling (FBM, Webb et al., 1997), and reconstructive methods like INSTRUCT (Mitrovic et al., 1996) and SPENGELS (Bos & van de Plassche, 1994), to showcase diversity in approach and the benefits of a Bayesian approach.
5. Addressing Tractability Uncertainty and the Role of Cognitive Modeling
This section directly addresses the key challenges in student modeling: balancing representation quality, inference tractability, and inherent uncertainty. Model-tracing and perturbation methods are highlighted as reaching the limits of tractability in complex domains. The section emphasizes the importance of using student performance data to drive student model design rather than relying solely on cognitive fidelity. The text further discusses the limitations of models representing unobserved internal states, arguing that such models cannot adapt online to individual student needs. The alternative approach of using observable variables for online adaptation is proposed. The challenges of high-fidelity cognitive modeling are discussed, contrasting it with an approach focused on predicting student behavior using data. The section highlights the limitations of existing systems like ANDES and HYDRIVE, which cannot adapt their models online due to reliance on unobserved internal student states. The discussion concludes by emphasizing the need for a data-driven approach in student model design.
II.Methodology A Five Step Approach to Building a Normative ITS
The research proposes a five-step methodology for developing a normative ITS. It begins with randomized data collection in a classroom setting using a partially complete ITS, followed by machine induction of a Bayesian network student model from this data. The third step involves constructing a decision-theoretic PAS strategy, linking the student model to decision problems the ITS must solve. The fourth step uses online Bayesian network induction to adapt the model to individual students over time. Finally, a full-scale classroom evaluation compares the complete system to the initial randomized version to assess the effectiveness of the normative techniques.
1. Randomized Data Collection Establishing a Baseline
The initial phase focuses on collecting data from a real-world classroom setting. A nearly complete version of the intelligent tutoring system (ITS) is deployed, but with a crucial difference: the pedagogical action selection (PAS) strategy is randomized. This means that the system's actions, such as problem selection, are entirely random, ensuring that the collected data reflects a broad range of possible interactions. All actions are logged in a structured format: <State, Action, Outcome>. 'State' describes the system's state before action selection (e.g., the student model or recent student history). 'Action' represents the randomly chosen pedagogical action (e.g., the next problem). 'Outcome' records the observed results (e.g., correct or incorrect). The randomness of the PAS ensures uniform data distribution across all possible actions, providing an unbiased starting point for subsequent stages of the methodology. This initial phase serves as a crucial baseline for comparison during later stages of development and evaluation. The data collected here forms the foundation upon which the adaptive student model will be built and refined. The emphasis on real-world classroom data collection highlights the commitment to practical applicability and informed development.
2. Machine Induction of a Bayesian Network Student Model
This stage utilizes the data gathered in the first step to create a Bayesian network student model. Machine learning techniques are employed to induce the model structure and parameters from the observed student performance data. The rationale for this approach is that initializing the model with real student data will result in a more effective starting point compared to relying on default values or expert estimations. The generated Bayesian network will represent the probability distributions over different student states and outcomes, allowing for prediction of student performance based on previous interactions. The use of machine learning enables the model to adapt to the specific characteristics of the student population. It moves beyond pre-defined assumptions to create a model directly informed by real student behaviour, resulting in a more accurate and robust representation of student knowledge and capabilities. The use of machine learning in this phase is a departure from traditional ITS methods, which often rely on manually constructed student models, highlighting the modern and innovative aspect of this methodological approach.
3. Decision Theoretic Strategy Construction Linking the Model to Actions
With a Bayesian network student model established, the focus shifts to designing a decision-theoretic strategy for pedagogical action selection (PAS). This step involves clearly defining the decision problems the ITS will need to solve (e.g., choosing the next problem, selecting appropriate feedback). These decision problems are then directly linked to the student model, utilizing its predictions about student performance to inform action selection. The student model's value lies in its capacity to generate predictions that will inform the choice of optimal pedagogical actions. It provides a mechanism to tailor the system's behaviour to the individual needs and progress of each student. Decision theory provides the mathematical framework for combining these predictions with predefined utility functions to make rational choices. This is a crucial step in transforming the student model from a passive representation into an active component that guides the adaptive teaching process, ensuring that the system's actions are both effective and efficient. This phase bridges the gap between theoretical modelling and practical application, showing how the model directly informs decision-making within the system.
4. Online Bayesian Network Refinement Adaptive Learning from New Data
This step is another machine learning phase, focusing on the online adaptation of the Bayesian network model. As new data is collected from a student interacting with the system, this fresh information is used to update and refine the existing model. The algorithm gradually biases the network towards the current student, personalizing the model over time. This online adaptation ensures that the model is constantly evolving to accurately reflect the student's changing knowledge and learning progress. While pre-existing algorithms for Bayesian network induction exist, this methodology addresses the unique challenge of handling dynamic student data, unlike traditional static data sources. This continuous learning capability is a core feature of the adaptive decision-making process, enabling the system to continuously improve its pedagogical strategies based on real-time student feedback. The online refinement process enhances the system’s responsiveness and personalization, making it more effective at supporting each student’s individual learning journey. Modifying existing algorithms to handle the dynamic nature of student data is a key innovation.
5. Full Scale Classroom Evaluation Assessing System Effectiveness
The final step involves a comprehensive classroom evaluation of the fully developed ITS. The goal is to assess whether the added computational effort of the normative calculations results in a superior system compared to the initial randomized version. This real-world testing is essential for validating the methodology and determining the true effectiveness of the system. The evaluation compares the performance of the complete system against the initial randomized baseline, directly assessing the impact of the Bayesian network student model and the decision-theoretic PAS strategy. This step provides crucial feedback for further system development and refinement. Classroom evaluation is critical, as it provides data that reflects the system's performance in a realistic learning environment, accounting for the nuances and complexities of actual classroom dynamics. This contrasts with traditional lab-based evaluations, which may not adequately capture the system’s effectiveness in a real-world setting. The focus on measurable improvement in student outcomes underscores the practical aim of this methodology.
III.Classroom Evaluation of SQL Tutor and CAPIT Results and Lessons Learned
An evaluation study was conducted using SQL-Tutor, an ITS for teaching SQL, to explore the feasibility of normative techniques in problem selection. A second evaluation involved CAPIT, an ITS for teaching punctuation to 9-10 year olds at Ilam School, Christchurch, New Zealand. In the SQL-Tutor study (October 1999), 18 students formed the control group, and 14 the experimental group. The CAPIT study involved three classes (Group A: control, Groups B and C: experimental). Results from both studies, although limited by factors like sample size and study duration, suggest that the proposed approach leads to improvements in student learning. The CAPIT study, with its three classes (approximately 27-30 students total), highlighted the effectiveness of the adaptive Bayesian student model in improving learning speed compared to a system with randomized problem selection. Challenges related to the complexity of modeling a large number of constraints in SQL-Tutor and the need for more sophisticated natural language processing in CAPIT were also identified.
1. SQL Tutor Evaluation Investigating Normative Problem Selection
This section details a classroom evaluation of SQL-Tutor, an ITS for teaching the SQL database language. The study, conducted in October 1999 with second-year university students, compared two versions of the system: one incorporating a probabilistic student model and problem selector (experimental group), and one without (control group). The experimental design involved a two-hour session including pre- and post-tests, with students randomly assigned to either group. A total of 18 students were in the control group and 14 in the experimental group. The study's timing constraints—the need for prior database knowledge and the limitations of the academic calendar—resulted in a smaller than ideal number of participants and a low post-test completion rate. While pre-test scores were comparable across groups, the low number of completed post-tests (one from the control and four from the experimental group) limited the ability to draw strong conclusions. However, despite the methodological limitations and the incorporation of some heuristic elements alongside normative techniques, the results still suggested some improvement with the probabilistic problem selection. This evaluation serves as an initial exploration of the proposed methods, informing future iterations and research directions.
2. CAPIT Evaluation A Four Week Classroom Study with Elementary School Students
The evaluation of CAPIT, an ITS designed to teach punctuation to 9-10 year olds, involved three classes (approximately 27-30 students total) at Ilam School in Christchurch, New Zealand, over four weeks. One class served as a control group (Group A), receiving no tutoring. The other two classes (Groups B and C) used the CAPIT system. Group B utilized a randomized version of the tutor, while Group C used the full version with decision-theoretic PAS and an adaptive Bayesian student model. Each class had one 45-minute session weekly, with students working in consistent pairs. All interactions were logged, and identical pre- and post-tests were administered. The study design directly compared the effects of randomized problem selection (Group B) versus the adaptive Bayesian approach (Group C) against a control group that received no intervention. Analysis of the data showed Group C (using the adaptive model) initially made more errors than Group B, but rapidly decreased their error rate, suggesting faster learning. This was particularly evident when analyzing specific constraints related to sentence separation. The results supported the hypothesis that the decision-theoretic approach in Group C led to more effective learning.
3. Lessons Learned Challenges and Future Directions
This section discusses the insights gained from the evaluations of SQL-Tutor and CAPIT. While SQL-Tutor's evaluation showed positive but limited results due to a large number of constraints making a purely Bayesian and decision-theoretic approach intractable (over 500 constraints), it highlighted the feasibility of incorporating normative techniques alongside heuristics. The limitations of SQL-Tutor (high constraint count) and similar systems (ANDES) which necessitate simplifying assumptions, led to the recognition that modeling all constraints in a single Bayesian network is not always computationally feasible. The evaluation provided guidelines for future research, including focusing on domains with a smaller, manageable number of constraints and leveraging improvements in hardware and algorithms to address tractability. This also suggests prioritizing a more targeted approach to constraint modeling. The CAPIT evaluation, involving the three classes of 9-10-year-olds, showed significantly faster learning in the group using the adaptive system, but also revealed limitations such as a need for better natural language processing capabilities. These findings suggest that future research should focus on improving system efficiency, considering qualitative algorithms, and developing better techniques for handling natural language and semantics, particularly in more complex domains.
IV.Comparison of Student Modeling Techniques and Future Research Directions
The thesis compares various student modeling techniques, including feature-based modeling (FBM), model tracing, and constraint-based modeling (CBM). It highlights the trade-offs between model complexity, inference tractability, and the inherent uncertainty in student behavior. Future research directions include: a comparative study of different student modeling techniques on teaching effectiveness; exploring more efficient qualitative algorithms for large domains; and addressing the challenges of natural language processing and semantic understanding in literacy domains. The relatively good performance of simpler models in predicting student performance suggests careful consideration of model complexity when designing ITS architectures.
1. A Review of Existing Student Modeling Techniques
This section provides a comparative overview of various student modeling techniques used in Intelligent Tutoring Systems (ITS). It discusses the strengths and weaknesses of different approaches, highlighting the trade-offs between the complexity of the model and the tractability of inferences. The document mentions several approaches, including fixed stereotyping (Milne et al., 1996), which is considered too limiting; feature-based modeling (FBM, Webb et al., 1997), which uses machine learning to associate problem states with actions; and model tracing, which attempts to track a student's problem-solving path. The challenges of accurately modeling student problem-solving strategies are discussed, particularly with respect to the complexity of dealing with composite bugs and the variability of student errors across different contexts. The section mentions constraint-based modeling (CBM) as a more tractable compromise and emphasizes the lack of comparative studies on the effects of different student modeling techniques on teaching effectiveness. This gap highlights a significant area needing future research, illustrating the need for empirical evidence to guide the selection of appropriate student modeling techniques for specific domains and learning objectives.
2. Contrasting Approaches Expert Centric vs. Data Driven Student Models
The discussion contrasts expert-centric and cognitive modeling approaches with data-driven approaches to student modeling. Expert-centric methods, like model tracing, strive for isomorphism with the expert's cognitive representation but often suffer from tractability issues. The thesis advocates a data-driven approach, prioritizing the prediction of student behavior over strict cognitive fidelity. Systems like HYDRIVE, which focus on capturing elements discriminating between expert and non-expert behavior, are presented as examples of this philosophy. A key disadvantage of models that represent unobserved internal states is their inability to adapt online to individual students. The example of a simple Bayesian model with observed and unobserved variables illustrates this limitation: the conditional probability relating them cannot be updated online. In contrast, models with observable variables (e.g., 'Before' and 'After' states) allow for online adaptation and continuous learning from student interactions. The CAPIT system, detailed in a later chapter, exemplifies this data-driven approach to online adaptation.
3. Future Research Directions in Student Modeling and ITS Design
This section outlines key areas for future research in student modeling and ITS design. It emphasizes the need for comparative studies to evaluate the impact of different student modeling techniques on the overall effectiveness of ITS. Efficiency considerations are a major focus, acknowledging the potential intractability of Bayesian networks and decision theory in large domains. The text suggests exploring qualitative algorithms as a more efficient alternative to numerical methods, mentioning qualitative Bayesian networks and decision-theoretic schemes as examples. Other future research directions include addressing the challenges of natural language processing and semantic understanding in literacy domains. The document also emphasizes the importance of justifying the choice of model complexity (e.g., using statistical significance tests) and exploring which domains are best suited for simpler models with no explicit internal representation of student states. The need to consider efficient methods to handle complex domains, along with a more thorough evaluation of existing models, underscores the ongoing evolution of this field.