Writing your own C++ standard library from scratch

146 pointsby JNRoweabout 2 months ago

17 comments

bregmaabout 2 months ago

He's not writing the C++ standard library from scratch. He's writing his own library, in a different namespace, with some similar functionality. It's easy to write a non-standard library that only satisfies your limited subset of needs. People do it all the time. It's not special.The ABI stability boast is based on having no legacy to support. It will work fine as long as everything is shipped only as code and the whole world needs to be rebuilt from scratch every time and even then, you never change any code ever even for a major bugfix. That's hardly practical in the real world where one tiny misstep on the ABI front can result in billion-dollar multinationals threatening suit (ask me how I know). It's a facile claim.The C++ standard library hasn't been known as "the STL" for almost 30 years, ever since part of the STL was modified and adopted into the C++ standard library. Most of the features he's providing implementations for were never a part of the STL (file I/O, strings, hash maps, UTF-8).I maintain an implementation of the C++ standard library for a living. It's a full-time job. It's a huge library (note to the committee: please stop) and it's really easy to mess something up. But if you want to write your own library that doesn't do what the standard library does or meet any of its requirements and implementation constraints or serve its real-world purpose, go right ahead. Just don't claim you're writing your own C++ standard library. You're not.

评论 #43475029 未加载

评论 #43470837 未加载

评论 #43475725 未加载

评论 #43469917 未加载

评论 #43473327 未加载

leni536about 2 months ago

The section about the "perfect ABI stability" is rather naive. If you have a 3rd party library that exposes a class like this in a header:<pre><code> class SomePublicClass { pystd::HashMap<pystd::U8String, size_t> member; /*...*/ }; </code></pre> and distribute that 3rd party library as compiled against a particular pystd version and the headers, then that build is tied to one particular "epoch" or version of pystd, you can't safely link that library against a program that uses a different "epoch" of pystd.It's also not a new idea either. libc++ puts everything inside an inline namespace `std::__1`. There is a reason that they never bumped that.

评论 #43473863 未加载

评论 #43476291 未加载

评论 #43469561 未加载

fefe23about 2 months ago

The title is confusing. He is not reimplementing the STL. He is writing some C++ classes providing functionality that is also already implemented in STL.

评论 #43469181 未加载

BonusPlayabout 2 months ago

A problem I encountered while writing custom stdlib, is that certain language features expect stdlib to be there.For example, <=> operator assumes, that std::partial_ordering exists. Kinda lame. In the newer C++ standards, more and more features are unusable without stdlib (or at least std namespace).

评论 #43469466 未加载

评论 #43469714 未加载

grandempireabout 2 months ago

There is a science to designing reusable containers and algorithms, and it’s based on research like Art of Computer programming and you can learn more by reading primary sources about the design of STL.STL can absolutely be improved, but posts like this indicate most programmers are clueless about how it works, and not in a position to learn from its mistakes and make something better.If we are serious about code reuse we need to study these ideas and learn how to actually write libraries. The alternative is the npm/crates model - where you throw together 100 different open source concoctions and hope it works.

unwindabout 2 months ago

Very nice! I like the tone and flippant energy of the post, of course, and also the way to get a nice scope by having a concrete case of a program to implement.I also appreciated the comparisons against STL, very informative. It's ... interesting that if including `vector` in STL brings in 27,000 lines, and the author's implementation of the functionality for the example program was only 1,000 lines, that the compilation time difference is only 4X. Not sure I understand that, really. But benchmarking is hard, of course.If I could come with a single suggestion it would be to include the sample program's source as text, not as a picture of text. If that means losing the pretty syntax highlighting, that's fine (by me). :)

评论 #43469571 未加载

dvhabout 2 months ago

I unexpectedly did some cpp few days ago and I was surprised that cpp standard library doesn't have string trim function! Everybody is rolling their own. What is the reason behind that?

评论 #43469688 未加载

评论 #43469852 未加载

评论 #43470246 未加载

评论 #43469812 未加载

评论 #43470848 未加载

评论 #43470452 未加载

quibonoabout 2 months ago

A bit off-topic maybe, what is a good open source library to read through to see some clean&modern C++? I've not dealt with the language in a bit and was thinking of diving back in

评论 #43470201 未加载

评论 #43470417 未加载

germandiagoabout 2 months ago

A bit off-topic, but as a Meson user, I would love to see C++ modules support since they start to be usable in all three big compilers.Nice experiment for the pyStd, though, as pointed out, this would break with pre-compiled 3rd party deps that use pystd in a different version :)

daeminabout 2 months ago

I do wonder how much smaller the STL source code would be if it was pre-processed or written with only a single C++ standard in mind. So only for C++20 or only for C++23 etc. In that case how much faster would things be to compile where it doesn't need to filter through hundreds of preprocessor options?

评论 #43469338 未加载

评论 #43469719 未加载

larussoabout 2 months ago

How will this year release scale when you need to work with newer compilers? I don‘t write cpp so I honestly don‘t know. Do you need to freeze your version of the compiler forever? Or is gcc / clang backwards compatible? Or do you sprinkle tons of pragmas on the files to control this? What I mean is how can you make sure your version one API is still compiling in the future. Take the counter example of ruby for instance. I could write a package with lots of namespaces and declare v1 frozen. But I still need to potentiallyupdate the code so it can run in newer versions of the runtime. Edit: typos

评论 #43475226 未加载

usrnmabout 2 months ago

Years ago every C++ project worth its salt had at least one implementation of of strings, vectors and other basic things. I hoped we finally were past that.

评论 #43469653 未加载

评论 #43471004 未加载

评论 #43469590 未加载

评论 #43471586 未加载

atombenderabout 2 months ago

> The C++ standard library (also know as the STL)The C++ Standard Library is not the same as the STL.The STL is the Standard Template Library, which provides containers such as vectors, as well as related functionality like iterators.The C++ Standard Library includes STL, but is a lot more, including things like I/O, math, concurrency, and so on.

评论 #43470327 未加载

singularity2001about 2 months ago

I did that to get minimal wasm binaries, not sure if tree shaking today would sufficiently shrinks c++ wasm apps.

zerrabout 2 months ago

I recall years ago there was some kind of competition in implementing full C++ parser/front-end and standard library, or something like that, allegedly organized by nvidia.

评论 #43472556 未加载

cozzydabout 2 months ago

Is there an explanation for the py prefix? Is that some sort of cruel joke?

SleepyMyroslavabout 2 months ago

Rant. I do not see any improvement in the outcome code. Lets not nitpick on fast parsing and just scroll through unnecessary code actions.> ... u8line(move(line))We are not reusing parsed line object between iterations. Forcing fresh allocation per line.> auto words = ...Fresh allocation per line.> lookup/insertLookup and hashing done 2 times for each word. Each unique word individually allocated on the heap.> stats.push_backNot preallocated. Likely doing full allocate + copy per each word.> sort_relocatableCould have been faster with additional memory provided. But this is minor because sorting probably was not ideal in the first place.and the icing on the cake:>printf("%d ... (int)count ...As old saying goes "One can write Fortran program in any language". There are zero reasons to write non type safe text output in 2025 in C++ but here we are.TLDR. One can name their foundation library any name and use any namespace it does not change how the code written much. Right?