Again on 0-based vs. 1-based indexing

186 pointsby Ivoahover 4 years ago

85 comments

One thing I would hope there is consensus on is that 1-based indexing is easier to learn. My son is doing Python at school, and the "first-element-is-actually-element-zero" is something that has to be dinned into them, a sure sign that it is non-intuitive. In similar vein, as adult programmers we know that the half-open range is the most useful mechanism for expressing a sequence, but again we have to explain to kids that if they want the numbers 1 to 12 (eg to do a times table program) they must type range(1,13), which at that stage of their learning just seems bizarre. Actually I could go on at length about why Python is a terrible teaching language, but I'll stop there !

评论 #25846089 未加载

评论 #25845007 未加载

评论 #25844849 未加载

评论 #25846094 未加载

评论 #25846678 未加载

评论 #25844976 未加载

评论 #25845237 未加载

评论 #25847004 未加载

评论 #25853168 未加载

评论 #25845924 未加载

评论 #25852971 未加载

评论 #25847264 未加载

评论 #25849492 未加载

评论 #25855654 未加载

评论 #25846919 未加载

评论 #25846997 未加载

评论 #25847553 未加载

评论 #25846683 未加载

评论 #25845581 未加载

评论 #25845310 未加载

samatmanover 4 years ago

0-based indexing with closed intervals is better for slicing. This shouldn't be controversial. It's because you can represent a zero interval cleanly: [3,3) is an empty interval after slot 2, representing a single cell is [3,4).This has two nice properties. One is that two slices are adjacent if the beginning and ends match, and the other, far more important, is that the length of the slice is end - start.That's the one that really gets us something. It means you can do relatively complex offset math, without having to think about when you need to add or subtract an additional 1 to get your result.I use Lua every day, and work with abstract syntax trees. I mess this up all. the. time.Of course you can use closed intervals and stick with 1-based indexing. But for why you shouldn't, I'm going to Appeal To Authority: read Djikstra, and follow up with these.<a href="https://wiki.c2.com/?WhyNumberingShouldStartAtZero" rel="nofollow">https://wiki.c2.com/?WhyNumberingShouldStartAtZero</a> <a href="https://wiki.c2.com/?WhyNumberingShouldStartAtOne" rel="nofollow">https://wiki.c2.com/?WhyNumberingShouldStartAtOne</a> <a href="https://wiki.c2.com/?ZeroAndOneBasedIndexes" rel="nofollow">https://wiki.c2.com/?ZeroAndOneBasedIndexes</a>

评论 #25844616 未加载

评论 #25844761 未加载

评论 #25844892 未加载

评论 #25845357 未加载

评论 #25844705 未加载

评论 #25845414 未加载

评论 #25848128 未加载

评论 #25846253 未加载

评论 #25847803 未加载

评论 #25845000 未加载

tzsover 4 years ago

In mathematics, both are common. For example, when working with polynomials or polynomial-like things, it is common to label coefficients starting with 0. E.g., a0 + a1 x + a2 x^2 + ...The subscript on the coefficient then matches the power of x, letting you write the general term as ai x^i, which works out great when using capital sigma notation.One the other hand, matrix rows and columns usually start from 1, so the elements on the top row are usually a11, a12, a13, ..., and the next row is a21, a22, a23, ..., and so on.In an attempt to bring some unity to the two sides on this issue in programming, let me offer something that I'm sure everybody will be able to agree on.Once upon a time I was implementing some mathematical code in C++. It was a mix of things from places where mathematicians would number from 0 and where they would number from 1.I decided that the code would be clearer if the code looked like the formulas in the math papers they came from, which meant I wanted to use 0-based arrays in some places and 1-based in others.Solution: I overloaded the () operator so that array(i) was a reference to array[i-1]. Then I could use 0-based or 1-based, depending on whether I was using a formula that came from a 0-based or 1-based area of mathematics.Everybody agree that this was not a good idea?

评论 #25848313 未加载

评论 #25852228 未加载

评论 #25859535 未加载

ajucover 4 years ago

TBH indexing became much less important with every language adopting some sort of "for all" and map/filter/reduce constructs. If you don't care about indexes you don't need to think about them (finally!).The remaining cases are by definition edge cases and warrant enough attention that bugs caused by 1- vs 0-based indexing doesn't seem to be a big problem in practice.It's like with goto and structured programming - people stopped using goto for loops and ifs, so the remaining cases where goto is used as last resort aren't much of a problem. People think hard before doing this.

评论 #25845922 未加载

评论 #25845081 未加载

评论 #25844185 未加载

评论 #25844862 未加载

kstenerudover 4 years ago

I was hoping for an interesting analysis of 0 vs 1 based indexing, but instead got a 3 page rant of HN-this and appeal-to-authority-that which adds absolutely nothing of value to the discussion. It feels more like a bottom-of-the-page angry comment to an article than an actual article.

评论 #25844715 未加载

评论 #25843799 未加载

评论 #25845950 未加载

评论 #25842957 未加载

评论 #25848115 未加载

评论 #25846166 未加载

chmod775over 4 years ago

Here's two arguments arguments in favor of 0-based 'indexing' (or offsets), that aren't just "because that's how it's done":1. It is faster. Due to how memory works, 1-based languages need to subtract 1 internally every time you access an array[1].2. It works mathematically better with some of the most common operations on array offsets, like modulo and division into round/ceil/floor. You'll be peppering your code with +1 or -1 around those a lot if you use 1-based indexing.[1]: This is lua, for instance: <a href="https://github.com/lua/lua/blob/master/ltable.c#L702" rel="nofollow">https://github.com/lua/lua/blob/master/ltable.c#L702</a>

评论 #25843031 未加载

评论 #25844310 未加载

评论 #25843320 未加载

评论 #25845874 未加载

评论 #25843968 未加载

评论 #25844328 未加载

评论 #25843397 未加载

GolDDranksover 4 years ago

> It really shows how conditioned an entire community can be when they find the statement “given a list x, the first item in x is x[1], the second item in x is x[2]” to be unnatural.It really shows how conditioned the mankind is that they call the "0th" ordinal number by the name "first". We have never moved past the phase where the concept of "zero" was heretic.Here + offset makes SO much sense. I think we should move to use it in other contexts too. "0 steps from the start of an ordered list" "1 step from the start of an ordered list" etc.At the moment we are using two different "scales" for measuring things and labeling orders. We could as well use the letters "A", "B", "C" for the latter, and it wouldn't change a thing. Hindsight is 20/20, but it's just a confusing mishap that we are using "numbers" and "numbers + 1" for the two purposes, where the second one could be expressed just as another "measurement", and we could get rid of the confusing "numbers + 1" scale.

评论 #25844947 未加载

评论 #25844883 未加载

评论 #25844919 未加载

评论 #25845913 未加载

评论 #25845476 未加载

评论 #25845742 未加载

DanielBMarkhamover 4 years ago

Disclaimer: I don't care about this argument one way or another. However I found the author missed the point (as perhaps many commenters did?)"...nowadays, all arguments that say that indexes should be 0-based are actually arguments that offsets are 0-based, indexes are offsets, therefore indexes should be 0-based. That’s a circular argument..."Yes, that is a circular argument. It's not the one I would use. C made the decision that indexes and pointers are the same thing. It's a logical argument and one that's not circular. If they're separate things, then yes, you end up in a circle.For languages that are not C-like, you could argue "pick an idiom and stick with it across multiple languages" or "make programming languages easier for humans to understand" Both of those arguments are preferential arguments. Perhaps there could be a study comparing the merits of each, but I haven't seen one yet.I like chocolate ice cream. I like making pointers and arrays as similar as possible. I like being able to collapse pointer arithmetic down. I like doing huge mallocs and then playing around with large, empty hunks of memory. Some folks don't. I get it. Code should look like the way we think about the problem. That's an ideal state we'll never reach, but it's worthy of continued discussion.

评论 #25845036 未加载

lifthrasiirover 4 years ago

Do not add any more fuel to the flame and instead use 2-based indexing: <a href="https://github.com/simonster/TwoBasedIndexing.jl" rel="nofollow">https://github.com/simonster/TwoBasedIndexing.jl</a>Seriously, the exact value of the lower bound for indexing doesn't matter here (some algorithms are best described with the lower bound other than 0 or 1, for example). The fixed or preferred lower bound is the real problem. Any argument for/against 0-based and 1-based indexing tends to gloss over the real problem because those arguments only exist to make some languages look better than other languages. As we move away from forced explicit indexing (e.g. arr.first() or foreach instead of arr[$LBOUND] or `for (i=$LBOUND; ...)`), it becomes clear that there is no such thing like the preferred lower bound for sequences at all.

评论 #25847330 未加载

nabla9over 4 years ago

Just like big- vs. little endian issue, the differences matter less than everyone adopting one. When you program having two code sets with different base of indexing is several times worse than either alone.0-based indexing is here. 1-based has no benefit. New languages should use 0-based indexing.

评论 #25842900 未加载

评论 #25844213 未加载

评论 #25843753 未加载

评论 #25842873 未加载

评论 #25844376 未加载

tincholioover 4 years ago

I kinda take issue with his complaining about referring to Dijkstra's argument as being an argument from authority. Dijkstra makes a pretty solid case in his writing, it's not just about him being Dijkstra.

评论 #25844375 未加载

评论 #25849750 未加载

bayindirhover 4 years ago

While I think these debates healthy (when they're civil and well grounded), found out that I don't have any preferences for one over other for things like array indexing.Every language has its own design decisions based on some requirement or opinion, and while I have rather strong preferences about languages, these kind of design decisions doesn't strike any nerves.As long as it's usable, that's fine by me.

thayneover 4 years ago

There are many situations where 1-based indexing is more natural. There are also many situations where 0-based indexing is more natural. And also many (possibly most?) situations, where it doesn't really matter.My 2 cents: it isn't really that important. As long as the developer is aware of whether the language they are currently using 0 or 1 based indexing.

评论 #25848757 未加载

grawprogover 4 years ago

For a general purpose programming language 1-based indexing probably causes more confusion and errors than benefits. A language trying to be a general purpose computer programming language shouldn't abstract in a way that's likely to lead to confusion with those that understand the fundamentals.For a domain specific language focused around simulating physical things, or a specific application, 1-based indexing may be the more appropriate option. In this case, abstracting the problem domain is likely more important than adhering to the limitations of hardware.

评论 #25842969 未加载

recursiveover 4 years ago

Linking to Dijkstra does not have to be an appeal to authority. I happen to find his position to be persuasive on its own merits.

评论 #25843084 未加载

tobrover 4 years ago

I don’t have any experience working with 1-based indexing, but it seems reasonable. I certainly remember how confusing 0-based indexing felt for a long time when I first learned to program.On the other hand, I’ve made an observation about timelines in music production software, where bars are counted starting from 1. As you zoom out, only every 4th bar tends to be labeled, which creates this strange sequence: 1, 5, 9, 13, 17… I always found that incredibly hard to reason about, and perhaps it would have been a better idea to label the first bar as 0.

评论 #25845149 未加载

ben509over 4 years ago

0-based indexing is preferable because the mathematics of half-open intervals make composing ranges far simpler. It becomes a bit more obvious as soon as you do any kind of non-integer ranges, because you find that only integers have a reliable "subtract one" operation.For instance, suppose you're doing a lot of date arithmetic, and you might include times. If I have a range:<pre><code> [2005-05-05, 2007-07-07) </code></pre> This unambiguously includes all times. If I compose it:<pre><code> [2005-05-05, 2007-07-07) [2007-07-07, 2008-08-08) </code></pre> If I shift all of them forward 10 days:<pre><code> [2005-05-15, 2007-07-17) [2007-07-17, 2008-08-18) </code></pre> Every possible time within the new range is accounted for. Transformations on the ranges are obviously correct. If we can't agree on whether time resolves to seconds, milliseconds or nanoseconds, it doesn't matter.That's because you can find the beginning and end of a range without knowing how to subtract by one. If you later decide to add times, it will just work because, conceptually, you already handle every possible value in the range.If you want to construct a series of ranges of floats, it's a bad idea to do [3.457, 6.799999], [6.8, 12.47699999]. If you use half-open intervals, it just works, without mucking with the end of the range.This is the reason that "indices are offsets" is powerful: it's not circular reasoning, but a simplification because we're removing an unnecessary primitive (subtracct 1) that only reliably works with integers.And to bring this back to integer offsets, even when you're working strictly with integers, you may want to conceptually work with rational numbers.When you're working out your math for buffered IO, you need to map byte indices to chunk indices. You need to assign every byte written to a chunk.In a 0-based world, your chunk offset is simply your byte offset / chunk length. Because there is no distinction between indices and offsets, it's a clean mapping between one offset scheme and the other.It means you can work out on paper when a new chunk should be started, and then translate that into integer arithmetic. And your code directly reflects the math, rather than having to stick "- 1" and "+ 1" in here and there.

steerablesafeover 4 years ago

> It really shows how conditioned an entire community can be when they find the statement “given a list x, the first item in x is x[1], the second item in x is x[2]” to be unnatural.It's not unnatural, but it's not natural either. It's just that 1-based indexing aligns with natural language conventions better. It also matches with some math conventions for matrices and vectors, but those don't have to use 1-based indexing either to work. I learned linear algebra with abstract "index sets" (I), where I can be {1, 2, 3, .., n}, {x,y,z}, {0,1,2}, whatever.Programming in 0-based indexing means way less +-1 adjustments on boundaries. Division/modulus also works way better with 0-based indexing, so flattening multi-dimensional arrays is more easily expressed too.In the end it's just a convention, it shouldn't matter too much. My money is still on 0-based indexing, as it lends itself to less errors and cognitive load when you get used to it. I used both extensively.Tangential: conventions that are more "natural" are usually not actually natural, but just align themselves to other conventions better. Of course it's always good if conventions align, but sometimes it's inconsequential. Also there are situations where we can't make all conventions align.An other little pet-peeve of mine is people calling big-endian more natural than little-endian, where there are actually different competing conventions:1. Usual way of printing memory content from lower address to higher address from left to right (think hexdump).2. Usual way of writing down numbers from left to right from most significant digit to least significant digit.3. Expressing the value of the number from the byte array elements (\sum_0^(width-1) byte[n] 256^n for little-endian, \sum_0^(width-1) byte[n] 256^(width-n-1) for big-endian)Big-endian aligns better with the combination of 1 and 2. Little-endian aligns better with 3.

评论 #25844535 未加载

评论 #25845101 未加载

ginkoover 4 years ago

One issue I see with 1-based indexing is that it makes x[0] undefined, so even if you pass an unsigned integer index to a function you need to verify that it's != 0 before you use it. So in a way it'd be the whole null pointer thing for array indices all over again.Another minor issue is that you can access one less element with a fixed size integer than when you do 0-based indexing. So for u8 you can only access 255 elements instead of 256 for instance.

评论 #25845826 未加载

评论 #25845801 未加载

评论 #25845890 未加载

LandRover 4 years ago

It's range that gets me all the time, some make the end inclusive, some exclusive.<pre><code> Clojure (range 1 10) ;=> (1 2 3 4 5 6 7 8 9) C# Enumerable.Range(1, 10).PrintAll(); 1 2 3 4 5 6 7 8 9 10 Ruby (1..10).each { |n| puts n } #=> 1 2 3 4 5 6 7 8 9 10 Python for i in range(1, 10): print(i, end=', ') => 1 2 3 4 5 6 7 8 9</code></pre>

评论 #25846135 未加载

dnauticsover 4 years ago

This is obviously spaces vs tabs third rail in PL design, but riffing off of the language in the article, probably we should all agree on two opinions:0-based indexing is inherently better for machines.1-based indexing is inherently more natural for humans.**For a short stint I was running a (small) datacenter, and despite being an apologist for 1-indexed programming languages, I zero indexed EVERYTHING in the datacenter.Let me just say, that was a huge mistake. Physical items are not offsets. Even though I was then working in a 0-indexed PL, The amount of contortion I had to do to remember zero indexing made the likelihood of errors higher (I did not make any that weren't quickly recoverable, but still).

评论 #25850150 未加载

评论 #25851015 未加载

kevin_thibedeauover 4 years ago

With 1-based indexing, you can't index the last element of an array that occupies the entire domain of the index type.

评论 #25844312 未加载

评论 #25844262 未加载

评论 #25844520 未加载

jkingsberyover 4 years ago

I know Visual Basic gets a bad rap, but as a learning language it had an interesting feature, `Option Base`. By putting `Option Base` in a module, it changed how the indexing of arrays worked. It defaulted to 0, but for some applications (and also, when you're first learning), 1 can be convenient.Of course, there are problems with this in a professional setting, such as how do you enforce uniformity across modules, and what happens if you copy code from one module that's Base 0 to one that's Base 1 and vice versa. But when I was first learning how to program, it was helpful to me to have a language that allowed for some choice.In the meantime, 23 years of programming have led me to believe that index base 0 makes sense. For many applications it's moot, because we should be using higher level functions (like map and reduce) for processing lists. In every other application (such as working with data on a grid), dealing with offsets does make things easier.Perhaps the convention is arbitrary. But, lots of industries have arbitrary conventions that we all agree on just to aid communication, and I disagree with the original author that the term "groupthink" applies in this situation.

stkdumpover 4 years ago

While 1-based index is probably easier to learn and understand for a beginner, even as a beginner there is quickly an overhead where you have to adjust indices by 1 all the time as soon as you do any math on indices. Going from Basic to C was hard for me for many reasons, but a lot of complexity in my code that I took for granted just vanished. This truly showed me the power in getting such fundamental choices right.

rufflezover 4 years ago

Why is this a thing?? Seriously, every language has its rules...just follow the rules and build something useful

评论 #25842997 未加载

tsegratisover 4 years ago

What convinced me is graphs' zero originWhen counting apples (i.e. only +) then 1 based. but when (* % ^ √ ÷) then 0 based

robertlagrantover 4 years ago

Rebutting the dismissal of offsets as an artefact of an old language:- pointer offsets - the article seems to dismiss these, but they're used loads in systems/network/driver programming, which is a reasonable chunk of the most important programming- conceptual offsets - e.g. time of day starts at zero, not one. We think in this way a lot, and it's useful.

mhandleyover 4 years ago

For things that wrap, such as ring buffer indices or packet sequence numbers in a fixed-size field or clocks (hour of the day, minutes in an hour), zero-based indexing is simple modular arithmetic. Wrapping of one-based indices can of course be done, but it's more complicated than necessary.

评论 #25846757 未加载

Yizahiover 4 years ago

Even worse is sometimes this 0-based numbering is carried over to a world of enumerable things, cargo cult like. In my project we have multiple physical entities which have numbers, e.g. blades, ports, channels on so on, all of them are counted from 0 because some genius decided to write that into standard. And now there are multiple points in the code where 1-based numbers are converted into 0-based and back, and incredibly - there are bugs there :) . Talking to humans gets equally weird and confusing when you are talking about second port numbered 1, it's just never gets natural, even after years working on these devices.

gabereiserover 4 years ago

Here’s some fuel to the flames: < is less chars than <= , it also is more logical. 0-based indexes have the advantage of if in comparison and range. Mathematics treats 0 as special and can have all sorts of side effects if dropped into a formula, code doesn’t have this side effect (unless running formula as an algorithm, which is math).To save yourself from headaches, I believe 0-based indexes are preferred in almost every modern language for the simple reason of optimization. I have no evidence to back this. NULL=Nothing=0 is a thing though.

mangecoeurover 4 years ago

Honestly 0-based indexing is one of the most annoying things to teach. In sciences, people really don't care about programming arcana: I have never found anyone who found 0-based indexing natural or intuitive.In fact, much of the vehemence in favor of it seem to be more about programmer shibboleths, to make some people feel they are in a special 'in-the-know' club (certainly if the r/programmerhumour reddit is anything to go by).All I can do for scientists is tell them the advantage of learning a programming language outweighs the weirdness.

评论 #25848383 未加载

sfgweilr4fover 4 years ago

Well... I wrote a snake game in python and lua.0 based array (python) : screen_pos = x + y * width1 based (lua) : screen_pos = 1 + x + y * widthshrugSo 0 based seems to work a little better for arithmetic based array references. But really you'll never resolve this to perfection. Assembly language programmers are very unlikely appreciative of 1-based.Its all about what you're comfortable with.Those wanting pointers can imagine a base address added to each calculation. 1-based then needs a -1... which is then a strange thing to advocate for.

numlock86over 4 years ago

> Again on 0-based vs. 1-based indexingThe problem is within the phrasing and its premise: Are you using an index or an offset? 99% of the time I think of an offset, so "first" makes sense as "0". If you go by indexing "1" suddenly makes sense as an index. When people say index in the context of arrays or data structures usually they mean an offset. To me it's simply wrong phrasing.

评论 #25844972 未加载

epageover 4 years ago

> You see, Lua uses 1-based indexing, and lots of programmers claimed this is unnatural because “every other language out there” uses 0-based indexing.> I’ll brush aside quickly the fact that this is not true — 1-based indexing has a long history, all the way from Fortran, COBOL, Pascal, Ada, Smalltalk, etc. — and I’ll grant that the vast majority of popular languages in the industry nowadays are 0-based. So, let’s avoid the popularity contest and address the claim that 0-based indexing is “inherently better”, or worse, “more natural”.Maybe I missed the comments about 1-indexing being "unnatural" but if the author is referring to the discussion I took part in, then this is a strawman. It wasn't about what was "natural" or "unnatural" but about people switching from their primary language to secondary languages and gotchas like 1-indexing being error prone.We had a platform team at my last company looking to adopt Lua for client customization. The primary authors became familiar with Lua in writing the code logic but everyone else would be touching their part, or contributing back to the core, not as people familiar with Lua but as C++ developers who would be blindly making changes in another language. It felt similar to maintenance of our Perl scripts. You don't brush up and become an expert on a language you interact with on a yearly cadence. It is important in these cases to have few gotchas to make casual contributions easier and safer.This says nothing about using 1-indexing when your target audience isn't 0-indexed programmers (speaking of those who choose to use Lua and not to Lua's creators).Unfortunately, for me, even with my complaints (this and language compatibility), I'll probably still use Lua for some projects of mine. I've looked at others and I'm mainly concerned about the community size.

psychoslaveover 4 years ago

Honestly, I feel like `some_list.first` is the most ergonomic.And if the API provides `second`, `third`, `antepenultimate`, `penultimate`, and `last` methods it covers a large part of what one most often use.Past third rank, then I will feel better served with a `slice` method, whose documentation provides explicitly it’s index convention, whether it makes a modulo on out of bounds, etc.

oconnor663over 4 years ago

Has this argument been rehashed much:If you use 1-based indexing, you can't iterate over a list of maximum length with a simple loop. The normal loop condition is `while i < len` (0 based) or `while i <= len` (1 based). If len is maximal, the second one probably has an overflow bug in the loop body that turns it into an infinite loop or a crash.

评论 #25845176 未加载

评论 #25842931 未加载

oiveyover 4 years ago

In the same way that I don’t think learning sklearn is the hard part of knowing ML, I don’t think 0 or 1 -based arrays is the hard part of any particular language. There are situations where both are appropriate. The choice of either as the deal breaker for a language is extremely superficial.

emn13over 4 years ago

I had a ROFL moment when he discredited appeals to authority, and then progressed to note that it's conventional math notation - which is pretty much the same kind of logic.Furthermore, mathematical notation is almost universally terrible. It's inconsistent, ambiguous, and has many undeclared dialects. Oh, and many people use 0-based index to boot, because, you know, whether that's conventional or convenient (largely orthogonal qualities alas in math) depends on the context.I mean, I can sort of buy his argument that it's arbitrary, but then he also points out that there's at least one advantage to 0-based indexing, namely that works well in offset based scenarios (not just for pointers). So in which case is 1-based indexing convenient? Based on this blog post, never.

评论 #25848575 未加载

klik99over 4 years ago

The harder it is to learn, the more it takes you to change your mind. It's hard to see things without bringing baggageAlso, the irony of complaining about HN holy wars becoming just another trigger to continue! I'll do my part - whichever side you are on - you are wrong.

midjjiover 4 years ago

An interesting perspective is what this does to pointers in terms of unsigned representation, effectively making 0 a special case which could be used for null. But in practical terms it would just limit 32 bit systems to 31 bits of adressable memory, or make pointer arithmetics wierd. The latter isnt that much of a problem, as its only when pointers are basic types it matters, and free pointer arithmetics is a mistake whose only advantage is compiler speed, not even application speed.For instance, even in slicing, [start:end) eliminates this issue. Though there is something to be said for verbosity.I think this becomes an argument for 0 based indexing, but a case could be made to the opposite.That said, consistency is what is valuable, so the only truly wrong answer would be the one we use in real life.

eterevskyover 4 years ago

I don't understand why the author treats referring to Dijkstra's paper as "appealing to authority". To me it is natural to defer to a well-written paper instead of repeating the same arguments in your own words, regardless on who wrote it.

taneqover 4 years ago

We can't even agree on this for floors of buildings. Half the world starts at 'G' then 1, 2, 3 etc. as you go upwards, the other half starts at 1, then 2, 3 etc.And then you have Japan which sometimes starts at 2F. Hey, maybe we should use 2-based indexing.

评论 #25846588 未加载

je42over 4 years ago

sometimes i program in Lua (which is 1-based) but mainly i write code in 0-based languages.The worst is really when you switch between the languages, but after a while you get used to it.You just need to watch out like a hawk, when calculating the index of the last element ;)

donatjover 4 years ago

I grew up on languages that were 1 indexed - BASIC, VB, VB.Net. It took me a long time to get used to 0 indexed, primarily in college and later in working with more C inspired languages, yet I think 0 indexed is superior. Why?Numbers in programming start at 0, even in the languages with 1 indexing. Period. When I initialize an integer, unless I explicitly set a value, it’s 0.From a purely pragmatic standpoint, making the first number an invalid location in a list just adds unnecessary complexity and increases the odds of off-by-one errors. On the same token, there’s nearly no benefit to starting at one.

评论 #25843203 未加载

评论 #25844136 未加载

syntaxingover 4 years ago

The biggest thing about index 0 or 1 is the inclusive and exclusive slicing. When the index is 0 and the slicing is [inclusive:exclusive], I find it a bit easier to write code with array manipulation.

moleculeover 4 years ago

This was submitted yesterday [0], w/ discussion today.- [0] <a href="https://news.ycombinator.com/item?id=25829966" rel="nofollow">https://news.ycombinator.com/item?id=25829966</a>

mannykannotover 4 years ago

I am pretty sure that the last, counterfactual, paragraph of this article is spot-on.Personally, I am not so much interested in which way is more right, but which way is less likely to result in off-by-one and related errors, though I do not know how you could gather data on that, other for people beginning programming, and any difference might go away or even flip as programmers gain experience and move on to more difficult problems. (Of course, the most effective solution is to not use indexing at all, except where absolutely necessary.)

gabordemooijover 4 years ago

Citrine is also 1-indexed (<a href="https://citrine-lang.org/#lists" rel="nofollow">https://citrine-lang.org/#lists</a>), I think this would help people who are not developers to read and possibly verify code a bit easier. Of course it is just a small step, but I feel code in general is drifting away from the normal users, who seems to be confine to graphical interfaces. Wasn't it an objective once in the software development community to try and make code as accessible as possible?

teekertover 4 years ago

"... of course, nowadays the number one reason is tradition and familiarity given other popular languages, and I think even proponents of 0-based indexing would agree, in spite of the fact that most of them wouldn’t even notice that they don’t call it a number zero reason."That's it, I'm switching to saying number 0 reason from now on. If you are talking about your number 1 reason, I will forever be wondering what your number 0 reason would be. I'll tell my wife tonight she is my number 0!

DonaldFiskover 4 years ago

Pascal, contrary to what the article states, allows any subrange of integer as an array index. For example,<pre><code> kernel: array[-3..3,-3..3] of real; </code></pre> is a 7x7 array. It has negative indices, e.g.<pre><code> x := kernel[-3,-3]; </code></pre> See <a href="https://www.tutorialspoint.com/pascal/pascal_arrays.htm" rel="nofollow">https://www.tutorialspoint.com/pascal/pascal_arrays.htm</a>Sometimes you want negative indices, e.g. when performing a convolution on an image with a kernel.

评论 #25847801 未加载

GuB-42over 4 years ago

Maybe the confusion comes from the fact that programming languages don't make a distinction between cardinal and ordinal numbers.In English we do. For example, if you are nine years old, you are in your tenth year. If an array has ten elements, the last element is the tenth element, it is nine elements away from the beginning. We could do "int array[10]; array[10th] = x;" if we wanted to mirror that.We could imagine a language that does the distinction, but I don't know what good it would do.

nathellover 4 years ago

Wacław Sierpiński (1882–1969), the namesake of Sierpiński's triangle, was famously a proponent of zero-based indexing, including in everyday life. Sadly, I can only find sources in Polish that corroborate this [0].[0]: <a href="https://wyborcza.pl/AkcjeSpecjalne/7,160474,24501452,zaczynali-od-zera-stali-sie-legenda-jak-warszawscy-matematycy.html" rel="nofollow">https://wyborcza.pl/AkcjeSpecjalne/7,160474,24501452,zaczyna...</a>

madhadronover 4 years ago

I settled on 0 based indexing as the right way to go when I was doing this calculation: <a href="https://madhadron.com/posts/2009-07-17-determining-affine-transforms-from-three-points.html" rel="nofollow">https://madhadron.com/posts/2009-07-17-determining-affine-tr...</a>When you start calculating offsets and transformations of coordinates, starting at 1 forces you to carry around extra additive constants everywhere.

Decabytesover 4 years ago

As weird as I find 1 based indexing(looking at you R) I still get confused by 0 based indexing. This happens all the time when I’m working in pandas and thinking about line numbers in the file vs the data frame, and I recently had an issue with this while implementing Conway’s Game of Life in Racket, while I was traversing the array and when I was looking for neighbors.It has been enough of an issue for me to seriously reconsider my stance on indexing

_0w8tover 4 years ago

After some experience with Go I realized that in quite a few cases it will be nice to have 1-based indexes. If one have a variable that is an index into an array, then it is natural to initialize it to an invalid index like -1. But in Go everything is initialized by default to 0. So to get -1 one needs extra code.With one-based indexes zero would be a very natural invalid value as an index similar to a null pointer.

评论 #25848414 未加载

zzo38computerover 4 years ago

My own opinion is that zero based indexing is generally much more useful, and makes a lot of things things simpler and/or more sensible mathematically or otherwise. However, there are some cases where an index based on some other number (often, but not necessarily, 1) is more useful. (Some programming languages, such as BASIC, allow you to define whatever starting index that you want to do.)

enriqutoover 4 years ago

It does not seem like a big deal, both conventions are easy to understand and natural to use. Except if you work with the discrete Fourier transform. Then 1-based indexing is extremely cumbersome and annoying. More generally, if your indices are to be understood "modulo N", it makes sense that they are all strictly smaller than N (or even signed, and centered around 0).

metreoover 4 years ago

Unfortunately off-by-one errors can be insidious and non-obvious to debug, having different systems only complicates this. 0-indexing makes sense for systems and computer programming while 1-indexing makes translating math more straightforward. Inter-converting between the two systems is unfortunately non-trivial, even if the cost isn't that high.

waiseristyover 4 years ago

I think it just comes down to where you want your index-out-of-range errors to occur.Pretty much any language I can think of inits primitives with all bits set to 0. So either, you accidentally forget to set your index vars to 1 and out-of-range it there, or accidentally forget to take away 1 when accessing the end of the array. Pick your poison

评论 #25842934 未加载

评论 #25843140 未加载

dmcgover 4 years ago

This reminds me that because array access is just addition, in C you can also write 2[a] to access the third element of a.

hhyndmanover 4 years ago

The APL language has this system variable ⎕IO (called QUAD IO -- index origin), which could be set to 0 (for 0-based indexing) or 1 globally in the environment.Also, ⎕IO can be set as a local variable of a function to limit the scope of the indexing origin, which made it convenient to simplify the code to implement certain algorithms.

axismundiover 4 years ago

Maybe we should be counting from zero in general?<a href="https://lasvegassun.com/news/2016/nov/07/the-human-calculator-is-fixed-on-helping-young-peo/" rel="nofollow">https://lasvegassun.com/news/2016/nov/07/the-human-calculato...</a>

cycomanicover 4 years ago

The funny thing is that even in natural language we don't use base 1 or 0 consistently.For example why is the 1st of January the first day of the month/year, but your first birthday is when you turn one year old, i.e. when your first year of life finishes.

ubriewbadukover 4 years ago

I'd say from a mathematical perspective, 0-based indexing makes more sense, simply because the Peano axioms start with 0 for the natural numbers. Also, thinking of overflows makes more sense this way, because it's just modular arithmetic.

Hittonover 4 years ago

Until Perl v5.30 it was possible to set how you would index with "$[" variable.

trompover 4 years ago

The first element being at index 0 feels as natural as 0 being the first natural number.Or let me rephrase that in the interest of avoiding natural language bias.The starting element being at index 0 feels as natural as 0 being the starting natural number.

SeriousMover 4 years ago

Discussing about 0 vs 1 indexing is like questioning the key layout of a piano. Sure, one is probably easier to learn than the other but in the end its an interface everyone is used to.

snarfyover 4 years ago

Indexes start at 1. Offsets start at 0. They are different things.

zgsover 4 years ago

The crazy part is that Lua accepts "array[0]" just fine and it works exactly as expected. This is a non-issue in Lua but many people cite it as a problem.

ChrisSDover 4 years ago

Previous discussion: <a href="https://news.ycombinator.com/item?id=25829966" rel="nofollow">https://news.ycombinator.com/item?id=25829966</a>

albrewerover 4 years ago

I never thought I'd say this, but you've got to give props to VB where you can declare the starting (and ending) index of your array to be whatever you want!

daemonkover 4 years ago

This is something that's extremely annoying to deal with in a genomics context. Various formats in the field uses 0-based or 1-based and inclusive or exclusive.

TuringTestover 4 years ago

Mystery solved<pre><code> 0 1 2 3 4 5 v v v v v v |---|---|---|---|---|---... ^ ^ ^ ^ ^ ^ 1 2 3 4 5 6...</code></pre>

ltbarcly3over 4 years ago

I don't know if one is somehow 'essentially' better, but 0 based indexing have clearly won as a convention.

whalesaladover 4 years ago

This is a true hacker mindset. Love the fact that the author is questioning the groupthink here in a constructive way.

IlliOnatoover 4 years ago

While I don't have a strong preference in this "war", I can think of an interesting argument for 1-based.Our time counting is 1-based. There is no zeroth second, zeroth minute, hour, day, month, year, or century. This actually makes it slightly harder in 0-based languages to work with time and dates.Also, it might be an indication that 1-based system is somewhat more intuitive to non-programmers.

评论 #25844189 未加载

评论 #25844142 未加载

评论 #25843328 未加载

评论 #25844229 未加载

评论 #25844218 未加载

评论 #25844208 未加载

jpttsnover 4 years ago

A newborn is not one year old.Midnight is not 01:01 AM.

评论 #25846293 未加载

PaulHouleover 4 years ago

This is why in languages like pascal or Ada you can make an array that runs from 7 to 35.

nathiasover 4 years ago

Are there any languages with reversed indexing, or some other more bizzare options?

klyrsover 4 years ago

Python supports -1 based indexing, so clearly that's the superior convention.

评论 #25844020 未加载

评论 #25844236 未加载

jibbitover 4 years ago

Erlang is kinda interesting.. 1-based, but you might never need to know that.

pasquinelliover 4 years ago

i don't really care if the language has me start counting indices at 0 or 1. it's never been the part of the problem of programming that has caused me significant trouble.

cookie_monstaover 4 years ago

Counting in binary gets pretty boring if you have to start from 1.

raverbashingover 4 years ago

Well yes, Python or js don't have pointer arithmetic but underneath that's what happens if you're dealing with an arrayZero indexing might look weird at first but I think it helps in most cases.

评论 #25842916 未加载

评论 #25842917 未加载

评论 #25842879 未加载

xirbeosbwo1234over 4 years ago

I find zero-based indexing more natural. Consider the case of dates. Dates are naturally one-indexed. There is no year zero.This means that the 21st century started in the year 2001. Likewise, the current decade started under a month ago, not over a year ago. If you think the 21st century started in the year 2000, as do most people, then that implies that the first century either started in the year 1 B.C. (ugly) or was only 99 years long (not a century). People naturally (and incorrectly) default to zero-based indexing in this case!I would go farther and make centuries zero-indexed too. It's far more natural to say that the Nth century is everything in the form Nxx than to say it's (N-1)xx. Every time I run across a numbered century I have to stop and think for a split second to calculate what years it covers. I got this wrong on several occasions in grade school and was marked down for it.Zero-based indexing works naturally for grouping things. One-based indexing does not.This, of course, doesn't matter much.

85 comments

tragomaskhalosover 4 years ago

评论 #25846089 未加载

评论 #25845007 未加载

评论 #25844849 未加载

评论 #25846094 未加载

评论 #25846678 未加载

评论 #25844976 未加载

评论 #25845237 未加载

评论 #25847004 未加载

评论 #25853168 未加载

评论 #25845924 未加载

评论 #25852971 未加载

评论 #25847264 未加载

评论 #25849492 未加载

评论 #25855654 未加载

评论 #25846919 未加载

评论 #25846997 未加载

评论 #25847553 未加载

评论 #25846683 未加载

评论 #25845581 未加载

评论 #25845310 未加载

samatmanover 4 years ago

评论 #25844616 未加载

评论 #25844761 未加载

评论 #25844892 未加载

评论 #25845357 未加载

评论 #25844705 未加载

评论 #25845414 未加载

评论 #25848128 未加载

评论 #25846253 未加载

评论 #25847803 未加载

评论 #25845000 未加载

tzsover 4 years ago

评论 #25848313 未加载

评论 #25852228 未加载

评论 #25859535 未加载

ajucover 4 years ago

评论 #25845922 未加载

评论 #25845081 未加载

评论 #25844185 未加载

评论 #25844862 未加载

kstenerudover 4 years ago

评论 #25844715 未加载

评论 #25843799 未加载

评论 #25845950 未加载

评论 #25842957 未加载

评论 #25848115 未加载

评论 #25846166 未加载

chmod775over 4 years ago

评论 #25843031 未加载

评论 #25844310 未加载

评论 #25843320 未加载

评论 #25845874 未加载

评论 #25843968 未加载

评论 #25844328 未加载

评论 #25843397 未加载

GolDDranksover 4 years ago

> It really shows how conditioned an entire community can be when they find the statement “given a list x, the first item in x is x[1], the second item in x is x[2]” to be unnatural.It really shows how conditioned the mankind is that they call the "0th" ordinal number by the name "first". We have never moved past the phase where the concept of "zero" was heretic.Here + offset makes SO much sense. I think we should move to use it in other contexts too. "0 steps from the start of an ordered list" "1 step from the start of an ordered list" etc.At the moment we are using two different "scales" for measuring things and labeling orders. We could as well use the letters "A", "B", "C" for the latter, and it wouldn't change a thing. Hindsight is 20/20, but it's just a confusing mishap that we are using "numbers" and "numbers + 1" for the two purposes, where the second one could be expressed just as another "measurement", and we could get rid of the confusing "numbers + 1" scale.

评论 #25844947 未加载

评论 #25844883 未加载

评论 #25844919 未加载

评论 #25845913 未加载

评论 #25845476 未加载

评论 #25845742 未加载

DanielBMarkhamover 4 years ago

评论 #25845036 未加载

lifthrasiirover 4 years ago

评论 #25847330 未加载

nabla9over 4 years ago

评论 #25842900 未加载

评论 #25844213 未加载

评论 #25843753 未加载

评论 #25842873 未加载

评论 #25844376 未加载

tincholioover 4 years ago

评论 #25844375 未加载

评论 #25849750 未加载

bayindirhover 4 years ago

thayneover 4 years ago

评论 #25848757 未加载

grawprogover 4 years ago

评论 #25842969 未加载

recursiveover 4 years ago

Linking to Dijkstra does not have to be an appeal to authority. I happen to find his position to be persuasive on its own merits.

评论 #25843084 未加载

tobrover 4 years ago

评论 #25845149 未加载

ben509over 4 years ago

steerablesafeover 4 years ago

> It really shows how conditioned an entire community can be when they find the statement “given a list x, the first item in x is x[1], the second item in x is x[2]” to be unnatural.It's not unnatural, but it's not natural either. It's just that 1-based indexing aligns with natural language conventions better. It also matches with some math conventions for matrices and vectors, but those don't have to use 1-based indexing either to work. I learned linear algebra with abstract "index sets" (I), where I can be {1, 2, 3, .., n}, {x,y,z}, {0,1,2}, whatever.Programming in 0-based indexing means way less +-1 adjustments on boundaries. Division/modulus also works way better with 0-based indexing, so flattening multi-dimensional arrays is more easily expressed too.In the end it's just a convention, it shouldn't matter too much. My money is still on 0-based indexing, as it lends itself to less errors and cognitive load when you get used to it. I used both extensively.Tangential: conventions that are more "natural" are usually not actually natural, but just align themselves to other conventions better. Of course it's always good if conventions align, but sometimes it's inconsequential. Also there are situations where we can't make all conventions align.An other little pet-peeve of mine is people calling big-endian more natural than little-endian, where there are actually different competing conventions:1. Usual way of printing memory content from lower address to higher address from left to right (think hexdump).2. Usual way of writing down numbers from left to right from most significant digit to least significant digit.3. Expressing the value of the number from the byte array elements (\sum_0^(width-1) byte[n] 256^n for little-endian, \sum_0^(width-1) byte[n] 256^(width-n-1) for big-endian)Big-endian aligns better with the combination of 1 and 2. Little-endian aligns better with 3.

评论 #25844535 未加载

评论 #25845101 未加载

ginkoover 4 years ago

评论 #25845826 未加载

评论 #25845801 未加载

评论 #25845890 未加载

LandRover 4 years ago

评论 #25846135 未加载

dnauticsover 4 years ago

评论 #25850150 未加载

评论 #25851015 未加载

kevin_thibedeauover 4 years ago

With 1-based indexing, you can't index the last element of an array that occupies the entire domain of the index type.

评论 #25844312 未加载

评论 #25844262 未加载

评论 #25844520 未加载

jkingsberyover 4 years ago

stkdumpover 4 years ago

rufflezover 4 years ago

Why is this a thing?? Seriously, every language has its rules...just follow the rules and build something useful

评论 #25842997 未加载

tsegratisover 4 years ago

What convinced me is graphs' zero originWhen counting apples (i.e. only +) then 1 based. but when (* % ^ √ ÷) then 0 based

robertlagrantover 4 years ago

mhandleyover 4 years ago

评论 #25846757 未加载

Yizahiover 4 years ago

gabereiserover 4 years ago

mangecoeurover 4 years ago

评论 #25848383 未加载

sfgweilr4fover 4 years ago

numlock86over 4 years ago

评论 #25844972 未加载

epageover 4 years ago

psychoslaveover 4 years ago

oconnor663over 4 years ago

评论 #25845176 未加载

评论 #25842931 未加载

oiveyover 4 years ago

emn13over 4 years ago

评论 #25848575 未加载

klik99over 4 years ago

midjjiover 4 years ago

eterevskyover 4 years ago

taneqover 4 years ago

评论 #25846588 未加载

je42over 4 years ago

donatjover 4 years ago

评论 #25843203 未加载

评论 #25844136 未加载

syntaxingover 4 years ago

moleculeover 4 years ago

This was submitted yesterday [0], w/ discussion today.- [0] <a href="https://news.ycombinator.com/item?id=25829966" rel="nofollow">https://news.ycombinator.com/item?id=25829966</a>

mannykannotover 4 years ago

gabordemooijover 4 years ago

teekertover 4 years ago

DonaldFiskover 4 years ago

评论 #25847801 未加载

GuB-42over 4 years ago

nathellover 4 years ago

madhadronover 4 years ago

Decabytesover 4 years ago

_0w8tover 4 years ago

评论 #25848414 未加载

zzo38computerover 4 years ago

enriqutoover 4 years ago

metreoover 4 years ago

waiseristyover 4 years ago

评论 #25842934 未加载

评论 #25843140 未加载

dmcgover 4 years ago

This reminds me that because array access is just addition, in C you can also write 2[a] to access the third element of a.

hhyndmanover 4 years ago

axismundiover 4 years ago

cycomanicover 4 years ago

ubriewbadukover 4 years ago

Hittonover 4 years ago

Until Perl v5.30 it was possible to set how you would index with "$[" variable.

trompover 4 years ago

SeriousMover 4 years ago

Discussing about 0 vs 1 indexing is like questioning the key layout of a piano. Sure, one is probably easier to learn than the other but in the end its an interface everyone is used to.

snarfyover 4 years ago

Indexes start at 1. Offsets start at 0. They are different things.

zgsover 4 years ago

The crazy part is that Lua accepts "array[0]" just fine and it works exactly as expected. This is a non-issue in Lua but many people cite it as a problem.

ChrisSDover 4 years ago

Previous discussion: <a href="https://news.ycombinator.com/item?id=25829966" rel="nofollow">https://news.ycombinator.com/item?id=25829966</a>

albrewerover 4 years ago

I never thought I'd say this, but you've got to give props to VB where you can declare the starting (and ending) index of your array to be whatever you want!

daemonkover 4 years ago

This is something that's extremely annoying to deal with in a genomics context. Various formats in the field uses 0-based or 1-based and inclusive or exclusive.

TuringTestover 4 years ago

Mystery solved<pre><code> 0 1 2 3 4 5 v v v v v v |---|---|---|---|---|---... ^ ^ ^ ^ ^ ^ 1 2 3 4 5 6...</code></pre>

ltbarcly3over 4 years ago

I don't know if one is somehow 'essentially' better, but 0 based indexing have clearly won as a convention.

whalesaladover 4 years ago

This is a true hacker mindset. Love the fact that the author is questioning the groupthink here in a constructive way.

IlliOnatoover 4 years ago

评论 #25844189 未加载

评论 #25844142 未加载

评论 #25843328 未加载

评论 #25844229 未加载

评论 #25844218 未加载

评论 #25844208 未加载

jpttsnover 4 years ago

A newborn is not one year old.Midnight is not 01:01 AM.

评论 #25846293 未加载

PaulHouleover 4 years ago

This is why in languages like pascal or Ada you can make an array that runs from 7 to 35.

nathiasover 4 years ago

Are there any languages with reversed indexing, or some other more bizzare options?

klyrsover 4 years ago

Python supports -1 based indexing, so clearly that's the superior convention.

评论 #25844020 未加载

评论 #25844236 未加载

jibbitover 4 years ago

Erlang is kinda interesting.. 1-based, but you might never need to know that.

pasquinelliover 4 years ago

i don't really care if the language has me start counting indices at 0 or 1. it's never been the part of the problem of programming that has caused me significant trouble.

cookie_monstaover 4 years ago

Counting in binary gets pretty boring if you have to start from 1.

raverbashingover 4 years ago

Well yes, Python or js don't have pointer arithmetic but underneath that's what happens if you're dealing with an arrayZero indexing might look weird at first but I think it helps in most cases.

评论 #25842916 未加载

评论 #25842917 未加载

评论 #25842879 未加载

xirbeosbwo1234over 4 years ago