TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Stuff Your Logs

118 pointsby r4umabout 4 years ago

4 comments

ChrisFosterabout 4 years ago
This is really neat. In the past I&#x27;ve used similar techniques to decode binary data from a third party lidar system in parallel. In a way that the manufacturers probably didn&#x27;t intend or expect.<p>The system generated large data files which we wanted to process in parallel without any pre-indexing. It turned out that these streams contained sync markers which were &quot;unlikely&quot; to occur in the real data, but there wasn&#x27;t any precise framing like COBS. Regardless, the markers and certain patterns in the binary headers were enough to synchronize with the stream with a very high degree of reliability.<p>So for parallel processing we&#x27;d seek into the middle of the file to process a chunk of data, synchronize with the stream, and process all subsequent lidar scanlines which started in that chunk. Exactly the algorithm they describe here.<p>Amusingly this approach gave reasonable results even in the presence of significant corruption where the manufacturer&#x27;s software would give up.
c0l0about 4 years ago
Having skimmed the article (because $dayjob and all), I wonder how&#x2F;if their scheme can cope with write(2) producing a short write, with not all the data in the buffer being atomically committed to their POSIX-compliant backing store?<p>I don&#x27;t see any mechanism described that makes sure that never happens (by forcing records not to exceed a given length that can always be written atomically - which I am not sure even exists...), so I am wondering how often that kind of thing even happens on contemporary systems, and - if it does - how often they wreck a good number of stored records that way.
评论 #26511108 未加载
评论 #26513961 未加载
评论 #26511076 未加载
评论 #26518024 未加载
codetrotterabout 4 years ago
This is a wonderful article! This is exactly what I need, I can’t wait to implement this! Thank you both author and person that posted this ^^
评论 #26513569 未加载
mefabout 4 years ago
site appears to be down... archive link <a href="https:&#x2F;&#x2F;archive.ph&#x2F;https:&#x2F;&#x2F;pvk.ca&#x2F;Blog&#x2F;2021&#x2F;01&#x2F;11&#x2F;stuff-your-logs&#x2F;" rel="nofollow">https:&#x2F;&#x2F;archive.ph&#x2F;https:&#x2F;&#x2F;pvk.ca&#x2F;Blog&#x2F;2021&#x2F;01&#x2F;11&#x2F;stuff-your...</a>
评论 #26515687 未加载