Ask HN: How do I examine an undergraduate programming course in the era of GPT?

5 pointsby limbicsystemover 2 years ago

I teach a course on Python programming at a UK university. The assessments are 'take-home' programming problems a little like Advent of Code. Cheating is always a concern but now it is trivially easy to generate code that gets a very high mark (I have checked). What do HN readers suggest as an strategy moving forward? In person exams are possible but not ideal.

12 comments

moomoo11over 2 years ago

I have a radical take on this. Why not just let them do what they want? Put all the emphasis on the exams.Education for some reason has always been about bullshit busy work anyway. Homework is for people who don’t understand the material and need more practice imo and should be optional and worth 0-5% of the grade.Have midterms finals and projects count more, and the tests would probably be monitored with proctors anyway right?I went to a mediocre college because in high school I would get F for homework. 95-100 on midterms and finals. 5s on all AP tests and almost perfect on SAT. artificial gate keeping based on a GPA based on BS work because even if I got the material and ace the tests the best I can get is like a B.And I did “bad” in college too where I failed homework and scored 95-100 on exams. If you’re gonna impose busy work on me I’m not doing it if I don’t feel it’s necessary for me. And I shouldn’t be punished for it. And neither should these students if they are genuine about learning. I’m there to learn by choice.Education needs to evolve. It keeps some people down for no reason.

yorwbaover 2 years ago

Does cheaters getting a high mark have a direct negative impact on other students? (E.g. grading on a curve.) In that case, you should probably switch to a method that makes everyone's mark depend only on their own performance.Other than that, students who don't understand the material well being able to outperform others who do may seem unfair, but in the end your course probably exists to teach students programming fundamentals that are expected as prerequisites in other courses, and the exercises serve to prepare students for the more difficult problems they'll encounter in those other courses.If a student decides they don't need that practice and ends up stumped later when GPT can't help them anymore, that's their own bad decision to deal with. (On the other hand, if that never happens maybe the Python course wasn't all that necessary for them to begin with.)So I recommend optimizing mostly for the students who do want to learn something; put enough effort into anti-cheating measures to keep honest people honest, but don't worry too much about cheaters beyond that.

评论 #34657946 未加载

mwest1066over 2 years ago

At UIUC we've been moving towards testing student programming skills in our Computer-Based Testing Facility (CBTF). This is a set of computer labs that are firewalled from the internet. Students self-schedule into testing slots at the CBTF, then at their scheduled time they go to the CBTF and take a programming (or other online) test that is auto-graded on the spot. This is very secure and has little overhead for faculty. It also automatically handles disability accommodations like extended time tests and reduced distraction requirements, and if students are sick then they can automatically reschedule their own exams. We currently run about 70,000 exams per term like this and it's heavily used by our largest classes (over 1000 students in a class). More details here: <a href="http://zilles.cs.illinois.edu/papers/zilles_csedu_cbtf_2019.pdf" rel="nofollow">http://zilles.cs.illinois.edu/papers/zilles_csedu_cbtf_2019....</a>For OP, we've also been using Zoom-based proctoring options for students/courses that can't use the CBTF for some reason. I'd be happy to follow up with you if you want more details about how we make this work at scale.

lolinderover 2 years ago

In my undergrad CS coursework, we were graded largely on projects, not exams.If the solution isn't short (more than 200 lines), ChatGPT and Copilot can help but cannot solve the problem completely. The more complex the project, the harder it will be for someone to generate it whole cloth with an AI, and even if someone is using an AI to help they'll still need to understand the concepts well enough in order to get good results.

评论 #34657642 未加载

meltynessover 2 years ago

Assuming there's more well-authenticated artifacts, you could use something like a causal-graph to identify where students may not have been honest in their take-home work. If a 100% comprehension level of say, nested for-loops, suddenly becomes a 30% comprehension when the student is being watched, that's cause for concern, you'd just need to have infrastructure in place to trace those relationships.Another good option would be to ask for pseudocode, drawings, flowcharts instead of Python. I don't have access to see if Copilot can pseudocode things, but I don't think the mechanical task of literally writing code is the best way to test this sort of thing anymore or perhaps, it's a bad target/metric.Switching the course to some obscure graphical programming language might be another good option as well.

MarkMcover 2 years ago

I am director of an online bootcamp, and we tackle the cheating problem with a custom browser-based platform that records 3 video feeds during an exam: 1. the screen capture, 2. the webcam and 3. the video feed of the students phone placed so we can see the students desk. It makes it pretty hard to cheat, but does require lots of supervisor time getting all students set up correctly and then monitoring the video feeds

kingkongjaffaover 2 years ago

Can you teach using git repositories?Have them commit in a way that shows their understanding at each point as they incrementally improve or solve the problem.Have them split into groups and submit MR's to each other.Give them them ability to tag other.Have them upload their MR to a system with CI and check that it passes tests, lint, style etc.If the commits are broadly similar then likely they did the same process as most of the class, if the commits are much different then check their understanding and/or cheating.

nextosover 2 years ago

I think teaching proper engineering, i.e. building whole systems where the design/architecture is what gets graded, would make you immune to GPT.Obviously, this only makes sense if your students are CS majors. If they are not, you can always teach a light version of this in the spirit of HtDP or <a href="https://dcic-world.org" rel="nofollow">https://dcic-world.org</a>.

muzaniover 2 years ago

When I was in college, students could do any math exam in Maple and Matlab, so the questions required Maple and Matlab. The rest were derivation questions, which both didn't do.Just raise the bar. Give them a lot of code and ask them to hack a feature into it. Chain together multiple difficult programming questions to the point that they have to use GPT to get the solution.

imajlivover 2 years ago

Steer clear of the most classic programming problems, don't use problems considering widely known concepts (like prime numbers) but rather define your own concepts that are not as widelybused. Look for/design questions that require the student to recognize edge or special cases and handling them. Combination of these seems to usually throw ChatGPT off

hnthrowaway0328over 2 years ago

Don't do exams, give them readings plus group projects and let them present them. Not sure how large the class is but this is probably not realistic when you have 10+ groups.

ihatepythonover 2 years ago

Perhaps you can have your students implement C-style for loops in Python?