TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ask HN: What's the point of automated skill assessment tests in the age of AI?

11 点作者 neverminder超过 1 年前
Back in the day there were two choices: refuse do do it or jump through the hoops. Today nobody I know bothers with automated skill assessment tests any more. Folks just feed it into ChatGPT and be done.<p>Yesterday I&#x27;ve applied to a role I found interesting, received a confirmation email and seconds later - skill assessment test link. So it looks like nowadays humans no longer bother reading the CVs, it&#x27;s all left to the bots. The test involved a tech stack I was not familiar with, so I thought it would be a good opportunity for the ChatGPT and it handled it well without even breaking a sweat. The though occurred to me that today one side uses AI to create and evaluate these tests and the other side uses AI to complete them. Is this the new reality? What&#x27;s even the point?

9 条评论

huevosabio超过 1 年前
Here is the solution to this:<p>1. Use AI to do the full on-site interview loop. At least for SWEs it is heavily structured, so something a good wrapper around GPT-4 can handle. It can even have an avatar if need be.<p>2. The AI-led &quot;on-site&quot; is done at your own time. But you must have a camera enabled.<p>3. HR just goes over the footage and other features to see if you have been using aids. In any case, the way you respond will be highly indicative of whether you used any aids.<p>4. If you pass and there is no cheating detected, then go straight to meeting the hiring manager. This interview is mostly behavioral and guessing whether you are a good match.<p>The problem today is that interviewing is expensive for both parties. Companies in particular get drowned with applications so they put all types of hurdles and auto-rejections.<p>But with AI, you could flip the process around. You could give _everyone_ the chance to do the full technical interview! Then you leave the human matching components for the best performing candidates.
viraptor超过 1 年前
Ideally, companies would evaluate what answers ChatGPT is capable of and&#x2F;or give tests which contain enough space for creativity that they&#x27;d be hard to pass this way. (Ask people to show off their knowledge of some domain without explicit instructions) If you get old style tests, they just haven&#x27;t caught up to what&#x27;s happening yet. Hopefully they&#x27;ll realise after a few second stage interviews and change it.
评论 #38006045 未加载
monkaiju超过 1 年前
Hopefully eventually we fallback to the more analogue solutions that have always been better anyway. Things like actually talking with applicants, seeing things they&#x27;ve actually built that have existed for some amount of time, talking to people in their networks, etc.
yodsanklai超过 1 年前
Companies will interview you on a white board, like many already do. There&#x27;s nothing new here. This is why we&#x27;ve been taking tests in class and not at home where we could use friends or family to do them for us.
kragen超过 1 年前
an interesting question is how to filter out the folks who just feed it through chatgpt, because obviously the people who just lie to get out of any mild annoyance are the ones you least want as coworkers (though obviously people who can effectively use large language models will be incredibly valuable)<p>i asked gpt-4 some programming questions today, and although it gave clear and convincing explanations in english, it made a lot of really dumb mistakes in writing the code. but this was in a mode where it couldn&#x27;t call out to an external interpreter; maybe a lot of those will go away once it can test the code<p>most programmers would have done worse, but is there a pattern to the dumb mistakes chatgpt makes that is clearly different from the dumb mistakes junior programmers make?<p>like for example i gave it this<p><pre><code> #!&#x2F;usr&#x2F;bin&#x2F;perl # See also cifs.py. Edited GPT-4 output; see &lt;https:&#x2F;&#x2F;bin.gy&#x2F;clignitter&gt; # Try watch -n .1 .&#x2F;wing.pl. for(1..9999){$t=0|rand 3;($y,$x)=$t?($y&#x2F;2,$x&#x2F;2+($t-1?40:10)):(.3*$x-.4*$y+12, .4*$x+(.2+abs($$%99&#x2F;99-.5))*$y);$a[$x][$y]=1}map{$r=$_*2;print map({substr &quot; .&#x27;:&quot;,2*$_-&gt;[$r]+$_-&gt;[$r+1],1}@a[0..79]),&quot;\n&quot;}(0..18); </code></pre> and asked it to translate it to lua, getting the following nonsense in response<p><pre><code> math.randomseed(os.time()) local a = {} for i=1,9999 do local t = math.floor(math.random() * 3) local x,y if t == 0 then x,y = .3*a[1][2]-.4*a[2][1]+12, .4*a[1][2]+(.2+math.abs(os.time()%99&#x2F;99-.5))*a[2][1] else x,y = a[1][2]&#x2F;2, a[2][1]&#x2F;2+(t-1 and 40 or 10) end if not a[x] then a[x] = {} end a[x][y] = true end local str = &quot; .&#x27;:&quot; for r=0,18 do for c=0,79 do local val = (a[c*2] and (a[c*2][r*2] or 0) or 0) + (a[c*2+1] and (a[c*2+1][r*2+1] or 0) or 0) io.write(string.sub(str,val,val)) end print() end </code></pre> of course, i&#x27;ve seen worse from interviewees, and understanding the original program is a high (and arguably counterproductive) bar, but perhaps the particular pattern of bonehead errors here could identify this as llm output? maybe the very fact of being so spectacular at understanding the original code is a red flag, at least combined with being so incompetent at writing the lua version?
评论 #38006055 未加载
评论 #38008626 未加载
评论 #38005835 未加载
from-nibly超过 1 年前
My opinion: if it was this easy to make interviews completely pointless then they were completely pointless before AI.<p>Why would a random skill assessment be able to tell if you are going to be a good employee? Your value isn&#x27;t tied to you knowing a bunch of random facts. We have the vastness of the Internet for that.<p>All AI has done is point out how useless our current interview process is.
shinycode超过 1 年前
Some companies now ask to do a coding test with screen sharing. It&#x27;s not a hard test but it&#x27;s interesting because you can see the reasoning and ask for more details if necessary. Live sessions means the company is interested in humans but it requires potentially lots of wasted ressources to find the right person.
HPsquared超过 1 年前
I guess it tests the ability to successfully apply ChatGPT. Which is a fairly relevant skill now, I guess?
评论 #38005510 未加载
jsyang00超过 1 年前
What&#x27;s even the point of a lot of these jobs in &quot;the Age of AI&quot;?<p>I don&#x27;t know. You don&#x27;t know. Let&#x27;s hope it all keeps going a little longer so some of us can keep a roof over our heads.