TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

The drama in trying to convert election PDFs to Spreadsheets

716 点作者 markessien大约 2 年前

40 条评论

OoTheNigerian大约 2 年前
Nice read. It&#x27;s important to note<p>1.The 2020 protesters did not begin vandalizing property, but government infiltrated the protests by burning cars and maiming people.<p>2. The Obidient movement encompassed multiple sub movements of which a part of the #EndSARS was one of them. A vast majority of Peter Obi&#x27;s supporters were not #EndSARS activists.<p>3. Elections in Nigeria are fraught with treacherous behavior so everyone suspects everything. It&#x27;s important to be very careful with your communication. There is a lot of desperation in the land and so if in a position of information leverage, the responsible thing is to handle the privilege with care and transparency.
crazygringo大约 2 年前
First of all, what a fantastic and inspiring read.<p>But, I&#x27;m left greatly confused -- the article never states whether this changed the result.<p>It says that halfway through counting Obi was in the lead, but nothing about when finished counting.<p>And when I look at the spreadsheet, the last row (#3380) appears to be the totals, which lists:<p><pre><code> APC LP PDP NNPP 149014 85748 329030 8305 </code></pre> Which shows LP (Obi) in third place, just like the official results.<p>So what point is the article trying to make at the end of the day? Or have I misunderstood the numbers?
评论 #35275648 未加载
评论 #35280167 未加载
评论 #35280405 未加载
评论 #35283678 未加载
djoldman大约 2 年前
Checking one at random:<p><a href="https:&#x2F;&#x2F;docs.google.com&#x2F;spreadsheets&#x2F;d&#x2F;1HhV9iJxXTU9liAZPIDoMOZ93L48uZRnnT13G-lqI150&#x2F;edit#gid=786971613" rel="nofollow">https:&#x2F;&#x2F;docs.google.com&#x2F;spreadsheets&#x2F;d&#x2F;1HhV9iJxXTU9liAZPIDoM...</a><p>...shows 0s in the first row for all candidate parties. But the corresponding photo shows votes for all three:<p><a href="https:&#x2F;&#x2F;inec-cvr-cache.s3.eu-west-1.amazonaws.com&#x2F;cached&#x2F;results&#x2F;2676&#x2F;result_46011_1677492816_thumb.jpg" rel="nofollow">https:&#x2F;&#x2F;inec-cvr-cache.s3.eu-west-1.amazonaws.com&#x2F;cached&#x2F;res...</a><p>I hope it&#x27;s not a mistake and that there&#x27;s some arcane law&#x2F;technicality to explain it.<p>edit: another mistake on row 21, LP should get 25 but it was credited to NNPP:<p><a href="https:&#x2F;&#x2F;docs.inecelectionresults.net&#x2F;elections_prod&#x2F;1292&#x2F;state&#x2F;1&#x2F;lga&#x2F;3120&#x2F;ward&#x2F;17712&#x2F;pu&#x2F;2707&#x2F;2707-1677468711.pdf" rel="nofollow">https:&#x2F;&#x2F;docs.inecelectionresults.net&#x2F;elections_prod&#x2F;1292&#x2F;sta...</a>
评论 #35273427 未加载
MontagFTB大约 2 年前
So the bug where the first voting sheet shown to a user was from the same 10% of the photos turned out to be a feature, serving as a CAPTCHA of sorts to weed out the bad actors from the good.<p>If memory serves, some CAPTCHA techniques include showing two numbers to transcribe, where one’s value is already known. If that number is transcribed incorrectly, then the other number’s result isn’t used, and the CAPTCHA fails. Perhaps a similar technique may have also helped here?
评论 #35273308 未加载
评论 #35273462 未加载
评论 #35273315 未加载
churchill大约 2 年前
Oh, and Mark didn&#x27;t mention that Bola Ahmed Tinubu was indicted for heroin charges in the US in 2003, forfeited $460k &amp; is just too old to run a democracy this size.<p>Atiku Abubakar (second candidate) was a former VP and the president he served under (Obasanjo) still insists the dude remains a monument to corruption.<p>There&#x27;s been a coordinated campaign at all levels to rig this election massively and we saw voter intimidation, manipulation in broad daylight, and the acquiescence of foreign governments to it all.
评论 #35277361 未加载
评论 #35276644 未加载
评论 #35273634 未加载
评论 #35277127 未加载
评论 #35278693 未加载
评论 #35273783 未加载
kevviiinn大约 2 年前
Wow what a cliffhanger, it sounds like they have to deal with the courts now. I hope we get an update<p><a href="https:&#x2F;&#x2F;www.msn.com&#x2F;en-us&#x2F;news&#x2F;world&#x2F;opposition-files-petition-against-nigerian-election-result&#x2F;ar-AA18Uc7z" rel="nofollow">https:&#x2F;&#x2F;www.msn.com&#x2F;en-us&#x2F;news&#x2F;world&#x2F;opposition-files-petiti...</a>
mtrovo大约 2 年前
Is the access to the original photos open? It might be fit for a good Kaggle competition, although maybe a little too late for this current election.
评论 #35275868 未加载
评论 #35280378 未加载
davedx大约 2 年前
Incredible story.<p>Some more background: <a href="https:&#x2F;&#x2F;ng.usembassy.gov&#x2F;nigerias-2023-elections&#x2F;" rel="nofollow">https:&#x2F;&#x2F;ng.usembassy.gov&#x2F;nigerias-2023-elections&#x2F;</a>
dec0dedab0de大约 2 年前
This would have been a good use for hn style shadow banning. Especially if they didn&#x27;t publish the current tally, then the original easy to detect bots may have never realized you were on to them
rqtwteye大约 2 年前
I still don&#x27;t understand how we ended up with PDF as sort of standard to archive data. PDF is already pretty bad for things like manuals but for things like spreadsheets we basically collect the data, then we destroy all the structure by putting it in into POF, and later on we painstakingly try to restore the data from PDF which is often almost impossible to do with accuracy.<p>It just shows that bad solutions often win.
评论 #35275853 未加载
评论 #35277365 未加载
评论 #35274795 未加载
评论 #35278328 未加载
评论 #35275489 未加载
评论 #35274875 未加载
评论 #35278631 未加载
评论 #35277169 未加载
评论 #35278853 未加载
redman25大约 2 年前
This might be a sensitive question but I wonder if something like this would work in the United States? With all of the fears of election interference why not trust but verify?
评论 #35277209 未加载
评论 #35277521 未加载
评论 #35285216 未加载
harvey9大约 2 年前
This is some compelling writing. I know this has real life implications for real people so I hope it&#x27;s not in poor taste to say it would make a good movie.
评论 #35279080 未加载
davedx大约 2 年前
More background. OP is an impressive entrepreneur! Massive kudos. <a href="https:&#x2F;&#x2F;markessien.com&#x2F;projects&#x2F;hotels-ng&#x2F;" rel="nofollow">https:&#x2F;&#x2F;markessien.com&#x2F;projects&#x2F;hotels-ng&#x2F;</a>
tr33house大约 2 年前
I&#x27;d tried something like this with the Kenyan election but our setup was to use OCR (google cloud) -&gt; text -&gt; parse -&gt; sqlite<p>We started late so the results were out when we finished but I think it&#x27;ll be a good idea to develop software that can parse the PDF results and display them faster than the electoral bodies can. In Kenya, and Nigeria, the delays cause a lot of anxiety
hoseja大约 2 年前
Silly, you don&#x27;t malcount the actual votes, you brainwash the population and pervert the process until they vote the way you want them to, like in the advanced first world democracies.
评论 #35277547 未加载
mattlutze大约 2 年前
This was thrilling.<p>Sometimes, one person&#x27;s bug is another person&#x27;s feature :)
londons_explore大约 2 年前
Isn&#x27;t things like this the reason that the UN provide election observers?<p>By spot checking just a random 100 votes are correctly tallied, you can be pretty sure the outcome of the election is legit in a &gt; 10M voter country.
评论 #35273780 未加载
throwaway81523大约 2 年前
I&#x27;ve done stuff like this semi manually. Use pdftotext to get the text tables out of the pdf, eyeball it and massage with emacs keyboard macros, and in some cases python scripts. It&#x27;s not that big a deal but it is somewhat ad hoc.<p>I know that OCR software is able to read stuff like magazine articles and figure out column layout, embedded charts, etc. It&#x27;s weird if is nothing to do that with a pdf. Maybe I&#x27;ll look around or see if I can hack up something.
评论 #35277449 未加载
hardlianotion大约 2 年前
That is a great job - well done from a grateful Nigerian.
评论 #35274892 未加载
mmmuhd大约 2 年前
Elupee 75, To be frank, you did a great job and i am proud of someone from my country pulling this off, but the bitter truth is President Elect Bola Ahmed Tinibu won this election. Peter Obi&#x27;s youth support is predominantly in the south, and Christian majority parts of the country, he clearly lack support in the Muslim north, where I am from. I voted for Kwankwaso though.
评论 #35273808 未加载
评论 #35273626 未加载
YeGoblynQueenne大约 2 年前
&gt;&gt; We had a brainstorming meeting, and decided to try a new approach. We would simply ask the Obidients to help us do the conversion. If hundreds of Obidients did the transcription, it would go fast.<p>What would guarantee that the Obidients would not, in turn, try to inflate the score of the Labor candidate?
评论 #35277783 未加载
seventytwo大约 2 年前
Wow, this was a fantastic read!<p>I have no idea what’s going on in Nigeria, but I hope the truth (whatever it is) will prevail!
nivenkos大约 2 年前
This is a great example of why electronic voting is important and can help secure democracy.
评论 #35273769 未加载
评论 #35273674 未加载
评论 #35275417 未加载
评论 #35279104 未加载
jxramos大约 2 年前
&gt; Then ominously, on the 20th of October of 2020 some people drove there in unmarked cars and removed all the Cameras installed at the tollgate.<p>They at least capture some photos of the equipment. I wonder if anyone communicated with the individuals.
pjc50大约 2 年前
Striking reminder of how big the world is that while I had heard of #EndSARS, I hadn&#x27;t realised the scale of the political violence in Nigeria nor that it had its own Bloody Sunday-scale massacre.
londons_explore大约 2 年前
The votes surprise me... In many regions one party gets 90+% of the vote.<p>Assuming the numbers are correct, then it suggests that most people are easily swayed by their local peers.<p>Is that common in say the USA?
评论 #35275613 未加载
评论 #35275578 未加载
评论 #35273818 未加载
blntechie大约 2 年前
What was the final result numbers from the transcription?
评论 #35273160 未加载
thread_id大约 2 年前
Fantastic story. What an excellent example of democratization from technology. And also a perfect example of how the blade cuts both ways. Digital warriors battling it out in real time and the stakes are enormous. Great respect for Mark and his ingenuity and adaptive responses!!!!
pxc大约 2 年前
I&#x27;m impressed by the courage of the protesters here, and the tenacity of the youth voters.<p>I hope they get a clear answer and a fair count, and whether they win this time or not, a real shot at cracking up their corrupt, two-party system.
neves大约 2 年前
Is it true that USA does not have a open data law to make everybody publish in CSV?
评论 #35277747 未加载
SergeAx大约 2 年前
Pdf is a very unfortunate format. It is proprietary, it is paper-oriented, its almost single goal is to keep precise printing layout. But for the last 30 years world didn&#x27;t come up with anything that could compete.
评论 #35273921 未加载
orf大约 2 年前
Fantastic story! Did the results get used in a claim?
vincheezel大约 2 年前
I hope for (but do not expect) a positive outcome
jgtrosh大约 2 年前
The context should be dated to 2020, not 2023 Edit: it was now corrected, no need to downvote<p>Great story! Looking forward to some follow up
评论 #35273467 未加载
dejongh大约 2 年前
Wow. Wild story. Thanks for sharing. Cool twist that a bug ended up identifying the bad guys.
clipper_janosch大约 2 年前
What an exceptional story. You are a legend.
prhrb大约 2 年前
What a scam by the ruling political party
roschdal大约 2 年前
The people who cast the votes don&#x27;t decide an election, the people who count the votes do. - Stalin.
评论 #35275913 未加载
snvzz大约 2 年前
Not providing CSV is at the level of criminal negligence.
churchill大约 2 年前
-
评论 #35273742 未加载