From Answer Keys to Socratic Mentors
SC1007 — Data Structures and Algorithms is a foundational, compulsory course for every undergraduate at the College of Computing and Data Science (CCDS), Nanyang Technological University — a cohort of around 1,000 students each year. It was redesigned under the College's Learn with AI framework for an AI-shaped future, where understanding why code works matters more than simply generating it.
AlgoGPT supports learning through guided hints rather than direct answers. Using Socratic questioning and a Driver–Navigator learning model grounded in Cognitive Load Theory and Self-Regulated Learning, students alternate between implementing solutions independently, explaining their reasoning, and critically evaluating AI-generated code to strengthen algorithmic thinking, reflection, and problem-solving skills.
NTU's College of Computing & Data Science is embedding AI into how computing is taught, practised and assessed through its Learn with AI framework. In SC1007, students explore core data structures and algorithms in an AI-supported environment that offers guided hints and feedback — never finished solutions — as they solve problems. Practice is dynamically generated to each student's skill level, so a cohort of over a thousand learns at its own pace while building the judgement to evaluate AI-generated code.
AI systems can generate code remarkably quickly, but understanding why that code works remains fundamental. In our courses, students learn to use these tools responsibly while still making sound design decisions, reasoning through problems, and verifying the correctness of their solutions.
Every problem moves a student through three modes of thinking — the structured approach at the heart of SC1007's redesign. AlgoGPT supports each stage with Socratic questions, never finished code.
Students put their algorithmic logic into words before any code is written, externalising the mental model so gaps surface early.
Students write and refine the solution themselves, receiving tiered Socratic nudges only on request — direction, never the answer.
Students review AI-assisted output critically, verifying it against the fundamentals — the skill that matters most in an AI-shaped world.
Each two-hour lab is built around the Zone of Proximal Development — students first build reasoning unaided, then extend it with adaptive AI support. The same AI-ON / AI-OFF principle carries into assessment: independent mastery is always verified separately from effective tool use.
Students design and analyse linked lists, stacks, queues, trees and graphs by hand — developing problem decomposition and debugging with no GenAI in the room.
Students pair with AlgoGPT in Driver and Navigator modes while Teaching Assistants act as meta-coaches — guiding responsible, reflective use of every piece of AI feedback.
"Artificial intelligence is a multiplier. But if the multiplicand is zero, the outcome is zero."
Standard large language models act as a black box: they hand over working solutions instantly, letting students bypass the very struggle that builds deep understanding. Learning theory is clear that durable knowledge requires effortful, active construction — and frictionless assistance short-circuits it.
Immediate, complete solutions. Speed without comprehension — and a learner who never owns the logic.
Structured friction by design. Hints, questions and deliberately flawed code that force reasoning — turning answers into earned insight.
AlgoGPT closes the loop between teaching and practice. Professors generate syllabus-aligned questions; students solve them in the same environment where labs are scheduled, submissions graded, and progress profiled — all in one place.
An adaptive Week-2 quiz establishes each learner's baseline mastery across core topics, then generates a personalised profile with targeted recommendations.
Theory-based multiple-choice components reinforce understanding alongside coding practice, ensuring conceptual clarity supports implementation.
Targeted sets are generated from each student's struggle patterns, drawn from a repository of several hundred problems — including 150 widely-used DSA interview questions.
The Driver–Navigator pair-programming environment turns every lab session into structured, reflective, friction-first practice.
Submission logs give a real-time pulse of the cohort, letting instructors detect unproductive frustration and intervene early.
A closed lab system with access codes, enrolment, and session tracking — built around the professor → TA → student hierarchy.
Instead of a single chatbot, AlgoGPT runs two specialised agents. The Driver withholds solutions and coaches through tiered Socratic feedback; the Navigator reverses the roles, writing deliberately flawed code for the student to review.
Students write code; the AI intervenes only on a student-initiated request. Rather than offering fixes, the Driver decomposes feedback into four escalating tiers — functioning like a senior instructor who scaffolds through questions, never supplying runnable code.
A conversational AI interface is available across every lab. Students may ask about concepts or requirements, but the AI still withholds direct code solutions. It serves as a within-system baseline — controlling for AI availability while isolating the incremental effect of friction.
AlgoGPT is built on a stateless backend with LangGraph agent orchestration and Azure-hosted GPT-4o models providing pedagogically constrained reasoning. Each agent has one job; together they manage cognitive load end to end.
Runs the diagnostic quiz, classifies mastery, and generates each student's personalised learner profile.
Inspects submissions to detect misconceptions and the specific reasoning gap behind a failed attempt.
Guards the Socratic protocol — ensuring feedback never leaks runnable solutions to the student.
Delivers real-time, non-directive feedback through reflective prompts and scaffolding strategies.
Sequences problems and difficulty, moving learners along a personalised topic progression.
Grounds every agent response in the exact problem statement and course knowledge base.
Across a 14-week semester, AlgoGPT's friction modes produced significantly higher-quality behavioural patterns than standard AI assistance.
Lower "frictionless" passing in friction modes reflects deeper cognitive effort — not failure.
26% ran to time expiry — an "immersion effect", not abandonment.
Driver mode raises iterative engagement; Navigator mode shows the "efficiency of insight" — fewer attempts, deeper pre-submission analysis.
| Metric | Normal · Baseline | Driver · Socratic | Navigator · Evaluator |
|---|---|---|---|
| Eventual success rate | 88.0% | 58.6% | 55.0% |
| Mean attempts per session | 3.07 | 3.24 | 2.36 |
| Attempts to first accept | 2.46 | 2.66 | 1.80 |
| Mean tests passed / submission | 3.51 / 10 | 5.13 / 10 | 3.57 / 10 |
Across the 14-week semester, 35,433 submissions were processed without meaningful attrition. Lab-level success climbed steadily as students adapted to friction-first practice — engagement increased rather than eroded.
By withholding runnable code, AlgoGPT removes the shortcut that otherwise consumes working memory without building skill — redirecting effort toward germane load.
Students plan, monitor, reflect, and check their own understanding — the survey's two strongest measured dimensions.
Navigator Mode's Accept / Reject gate makes students commit to a judgement before trusting AI output — reducing uncritical over-reliance.
Overall mean 3.39 / 5.0; all 12 items significantly above the neutral midpoint. "Checking understanding" gave the study's largest effect (t = 6.97).
AlgoGPT profiles every learner and detects struggle at a fine grain, so 24/7 Socratic coaching reaches a diverse cohort once shaped by access to private tutoring.
Week-2 profiling quiz — N = 760 students classified by the Learner Profiling Agent.
A mostly-beginner cohort — the profile that benefits most from progressive, confidence-building scaffolding.
A 9-level ordinal scale moves beyond code correctness to capture cognitive difficulty.
A lightweight, web-based platform — no high-end hardware, no installs. World-class scaffolding on any device.
Immediate, personalised feedback for learners who cannot afford private academic support — available at every hour.
Scaffolded support for struggling learners; critical evaluation for advanced ones — diverse entry points, one system.
A stateless architecture scales from ten students to ten thousand on standard infrastructure. The pedagogy is domain-agnostic, and the roadmap turns today's uniform friction into tomorrow's adaptive scaffolding.
Stateless sessions with no server-side memory overhead scale effortlessly on Azure container infrastructure and a React frontend.
The tiered-nudge protocol is tied to structured reasoning, not a language — ready for Systems Programming, Maths, Physics and beyond.
Automated, AI-blind test cases keep grading objective; submission logs let instructors detect unproductive frustration early.
Calibrating the intensity of "friction" to a student's real-time struggle, so productive struggle never tips into unproductive confusion.
Mermaid.js integration to render data-structure and algorithm visuals inside the feedback loop, anchoring abstract concepts.
Whisper-based models to let students give and receive feedback by voice — humanising the pair-programming experience.
Pre/post assessments and difficulty-controlled comparisons to move from evidence of engagement to measured learning gains.
The peer-reviewed deployment analysed 907 students and 14,958 submissions across 27 DSA problems; the figures throughout this site reflect the expanded post-publication dataset of 35,433 submissions.
Try the live platform that turned ~1,000 students from code-writers into logic-architects.