TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Rich Text, Poor Text (2013)

57 点作者 SerCe大约 1 个月前

8 条评论

II2II大约 1 个月前
You pretty much need to use markup (or control codes) for rich text. Take bold, italic, underline, strikeout: those four can, and are, used in nearly any combination. You would need one bit for each of them. You would need two bits to specify four levels of headings. If you don&#x27;t allow for that, you are back to using markup. You would also need one bit to specify proportional&#x2F;fixed width font, because that is a thing too. That remaining bit would have to be used for superscript, since superscripts are commonly used for footnotes and simple mathematical expressions.<p>Okay, you can now create passable rich text documents for a limited (though common) range of purposes with that 8&#x2F;24-bit breakdown that was suggested. But you may have noticed the author mentioned subscripts, which wasn&#x27;t in my list. Well, it turns out that subscript and superscript have a terribly limited range of applications if you are specifying them per character: x^2^2 would be visually identical to x^22, and x^a_b would look different from x_b^a (with both presentations being nonsensical). The use of subscripts and superscripts in any technical applications would be severely limited. You need a much richer markup language to be truly expressive. So there really isn&#x27;t much of a point in offering subscripts. Superscripts, sure, because they have a few non-technical uses.<p>Yet the reality is that people want a much richer set of formatting options. At a minimum, they want to select fonts and font sizes. Some of the formatting options have semantics. I know I crammed four levels of headings in those eight bits, but that only makes sense in headings. It doesn&#x27;t make sense to specify it per character. Then there are other common document elements, like tables. You can create decent tables using monospaced fonts, but that is limiting and would produce undesirable results in some cases (try displaying April 5^th sensibly, using a monospace font so that it won&#x27;t affect the width of the columns). On top of that, you are ditching the concept of styles because that implies some sort of markup.
评论 #43598597 未加载
评论 #43615120 未加载
tinthedev大约 1 个月前
Hah, I was about to criticise the text for far too lightly conflating markup and punctuation, just to see the afterword.<p>I actually do think the author has a point, in that must solutions today are inelegant, I also don&#x27;t think this is a problem which has a real elegant solution. Where to draw the line? Why not encode fonts into the standard too, if we&#x27;re doing bold? Etc.<p>I&#x27;m still mostly in favour of keeping everything markdown (in my own writing), however much it pollutes the &quot;purity&quot; of text.
评论 #43599944 未加载
hello_computer大约 1 个月前
This person is confused. He&#x27;s citing a Ted Nelson paper about separating these things into layers (content, structure, &amp; special effects), while personally advocating that we mash it all into unicode.<p><a href="https:&#x2F;&#x2F;www.xml.com&#x2F;pub&#x2F;a&#x2F;w3j&#x2F;s3.nelson.html" rel="nofollow">https:&#x2F;&#x2F;www.xml.com&#x2F;pub&#x2F;a&#x2F;w3j&#x2F;s3.nelson.html</a>
评论 #43597854 未加载
lewisjoe大约 1 个月前
A lack of universally recognized richtext format is really a problem. Why? practically any rich-text that needs to be rendered across platforms (web and mobile devices) are now being stored as html or markdown or app-dependent json.<p>HTML was never envisioned as a cross-platform richtext format and markdown lacks almost half of all formatting features. Specialzed json is even more evil because the content becomes unrenderable when the parent app goes out of existence.<p>op&#x27;s suggestion (accomodating formattings as unicode bytes) might not be optimal however I&#x27;m happy at least somebody thought of this as a problem to solve.
ht_th大约 1 个月前
The odd thing is, you can do quite some bold&#x2F;italics&#x2F;superscript in Unicode nowadays. Because, at least from the ASCII letter range, they have been used in symbolic ways in Mathematics, etc., and have been added to Unicode as symbols rather than bold variants of letters. For example:<p>, !<p>, !<p>ᴴᵉˡˡᵒ, ᵂᵒʳˡᵈ!<p>So, there&#x27;s almost no bold&#x2F;italic punctuation. And non-ASCII Unicode letters aren&#x27;t &quot;supported&quot; this way either. But you can get quite far with &quot;formatted&quot; ASCII letters in Unicode, if you&#x27;re so inclined.
评论 #43598989 未加载
AlienRobot大约 1 个月前
People are limited by their tools.<p>The author believes that plain text should encode bold, italic, etc., because that&#x27;s all they had exposure to. Were the text written today, they would claim emojis belong in unicode as well.<p>Most social media don&#x27;t support it, but on Tumblr, for example, you can specify the color of the text and even choose a different font. I think there was some other social media that allowed you to have animated effects on the text as well, but I forgot the name.
评论 #43596656 未加载
timeflex大约 1 个月前
Sad what things like Markdown has done to people. It&#x27;s like they forgot about all the amazing semantic markup of HTML 5 to create strong relations between their data. I&#x27;ll take a Lexical editor with SQLite to store my data any day.
评论 #43601413 未加载
评论 #43597326 未加载
评论 #43598417 未加载
keepamovin大约 1 个月前
I like the idea of keeping the presentation out of the content, but keeping it in the character encoding. It&#x27;s a cool idea. Never thought of it before reading this.
评论 #43604475 未加载