Hey, computer, make me a font

331 pointsby pavanyaraover 1 year ago

30 comments

I found a few months ago that the gpt-4 code interpreter is capable of converting a black and white png of a glyph to an svg<a href="https://twitter.com/lfegray/status/1678787763905126400" rel="nofollow noreferrer">https://twitter.com/lfegray/status/1678787763905126400</a>It would be cool to combine a script like the one gpt-4 gave me with an image generation model to generate fonts. The approach from this blog post is way more interesting though.On a separate note it reminds me of this suckerpinch video :) maybe we can finally get uppestcase and lowestcase fonts<a href="https://www.youtube.com/watch?v=HLRdruqQfRk">https://www.youtube.com/watch?v=HLRdruqQfRk</a>

评论 #37753956 未加载

评论 #37752882 未加载

评论 #37752872 未加载

评论 #37755177 未加载

martincmartinover 1 year ago

Douglas Hofstader, the author of Godel Escher Bach, thought the task of creating fonts could only be solved with general AI.<a href="https://www.m-u-l-t-i-p-l-i-c-i-t-y.org/media/pdf/Metafont-Metamathematics-and-Metaphysics.pdf" rel="nofollow noreferrer">https://www.m-u-l-t-i-p-l-i-c-i-t-y.org/media/pdf/Metafont-M...</a>The Letter Spirit project aims to model artistic creativity by designing stylistically uniform "gridfonts" (typefaces limited to a grid).

评论 #37755082 未加载

评论 #37758025 未加载

philipwhiukover 1 year ago

> To train the model, I assembled a dataset of 71k distinct fonts.I give it a week before Monotype sues your face off.

评论 #37752784 未加载

评论 #37753906 未加载

评论 #37757152 未加载

评论 #37756049 未加载

yklcsover 1 year ago

I’ve tried out some work on generating vector fonts too, in the format of Bezier curves and a seq2seq model. The problem was that fonts outputted by ML models were imprecise. Lines were not perfectly parallel, corners were at 89°, and curves were kinked. It’s not too difficult to get fonts that look good enough, but the imperfections are glaring as fonts are normally perfectly precise. These imperfections are evident in OP’s output too, and in my opinion make these types of models unusable for actual typesetting.A 1% error in a raster output would be pixel colors being slightly off, but a 89° corner in a vector image is immediately noticeable, making this a hard problem to solve. I haven’t looked into this problem too much since, but I’m interested to hear about possible solutions and reading material.

评论 #37755578 未加载

Jack000over 1 year ago

I think this approach isn't ideal because you're representing pixels as 150x150 unique bins. With only 71k fonts it's likely a lot of these bins are never used, especially at the corners. Since you're quantizing anyways, you might as well use a convnet then trace the output, which would better take advantage of the 2d nature of the pixel data.This kind of reminds me of dalle-1 where the image is represented as 256 image tokens then generated one token at a time. That approach is the most direct way to adapt a causal-LM architecture but it clearly didn't make a lot of sense because images don't have a natural top-down-left-right order.For vector graphics, the closest analogous concept to pixel-wise convolution would be the Minkowski sum. I wonder if a Minkowski sum-based diffusion model would work for svg images.

评论 #37758809 未加载

评论 #37758209 未加载

fennecfoxyover 1 year ago

He he the machine learning naysayers gonna jump on this one for sure.Consider a human being designing a scifi styled font; how do they get started? By opening references, of course! To examples of other scifi styled fonts that they do not have the rights to, nor will they credit.Also consider another human being designing a scifi styled font; but instead one that is not allowed to reference the work of anybody else, as some argue machine learning models ought to do. This human being has no references to open, they have not seen any scifi media, be it movies or posters or fonts or anything else. How can they create something like this without any reference at all to it?If a human being creates a scifi font, and their inspiration is not references to other scifi fonts but instead, I don't know, a general concept of the "vibe" they got from watching Blade Runner, must they credit Blade Runner for the inspiration? Must they pay the owner of the Blade Runner rights for their use of ideas from Blade Runner?

boffinAudioover 1 year ago

I've long had a project in mind involving the various typefaces of the signage around the city of Vienna, which I find very inspiring in many cases.The idea is to just take a picture of every different typeface I can find, attached to the local buildings at street level.There are some truly wonderful typefaces out there, on signage dating back to last century, and I find the aesthetics often quite appealing.With this tool, could I take a collection of the various typefaces I've captured, and get it to complete the font, such that a sign that only has a few of the required characters could be 'completed' in the same style?Because if so, I'm going to start taking more pictures of Vienna's wonderful types ..

评论 #37752214 未加载

评论 #37752818 未加载

scaryglidersover 1 year ago

Hmmm. The model is a ckpt instead of a safetensor.Pondering on whether to keep proceeding trying this out or not...EDIT: a scan with picklescan[0] found nothing.. exciting.[0] <a href="https://github.com/mmaitre314/picklescan">https://github.com/mmaitre314/picklescan</a>

评论 #37752056 未加载

评论 #37751935 未加载

Rantenkiover 1 year ago

OK, that's cool, but those fonts are all terrible. The serifs are all different sizes and shapes, sometimes on the same letter. The kerning looks like a random walk. The stroke widths are all over the place, and/or the hinting is busted.Now, that said, it's pretty amazing that this works at all, but it'll take some pretty specific training on a model to get something that can compete with a human made font that's curated for good usability _and_ aesthetics.Sadly, we'll also probably see adoption of these kinds of fonts (along with graphic design, illustration, songwriting, screenwriting, etc)... because "meh, good enough" combined with some Dunning-Kruger.TL;DR: Thanks, I hate it.

评论 #37755888 未加载

评论 #37755919 未加载

评论 #37767092 未加载

PaulHouleover 1 year ago

Kinda funny how it works well at this whereas diffusion models go to die when it comes to drawing text but of course it works in a completely different manner.

评论 #37752834 未加载

评论 #37754127 未加载

scaryglidersover 1 year ago

Okay I can't try it out anyway. "Blocksparse is not available: the current GPU does not expose Tensor cores"My "best" GPU is an RTX 2070 Super, Turing architecture.I've seen similar messages when using stable-diffusion... either with -webui or with automatic, can't exactly remember, but they both run fine on that RTX 2070 Super, so I can only guess that they revert to some other method than Blocksparse on seeing that it doesn't support Turing. Or something. I haven't looked into how they deal with it.I've submitted an Issue [0] for it. I don't have enough knowledge to know if there's some way of saying "don't use Blocksparse" for fontogen.[0] <a href="https://github.com/SerCeMan/fontogen/issues/2">https://github.com/SerCeMan/fontogen/issues/2</a>

dleeftinkover 1 year ago

Although I would be sad to see the handcrafting that goes into designing custom fonts go, some iterations down the line a model like this would greatly aid tedious glyph alignment and consistency tasks when designing CJK, hiragana, katakana and kanji fonts. Inspiring stuff.

评论 #37752036 未加载

评论 #37752236 未加载

logdahlover 1 year ago

Cool! Now generate 'upper-uppercase' and see what happens :^)

评论 #37752192 未加载

TheRealPomaxover 1 year ago

Neat! Does it have prompt capabilities for things like FVAR, GSUB, and GPOS? E.g. "okay now include a many-to-one ligature that turns the word 'chicken' into an emoji of a chicken in the same style" or "now make a second, sans-serif, robotic style and add an axis called interpol that varies the font from the style we just made to this new style"?

评论 #37752737 未加载

lawlessoneover 1 year ago

This is interesting but i think generating the next letter from the letters before may not be the best way to do it. As you mentioned they degrade with each letter.Maybe creating one long image of a whole font would work better.edit: in the above am misunderstanding what is happening here.But i still think there must be another way to structure this so the attention mechanism doesn't have to work so hard.

评论 #37753642 未加载

mastersummonerover 1 year ago

Poof! You're a font.

acid__over 1 year ago

Designing fonts for languages that use Chinese characters is often challenging due to the sheer number of glyphs.This approach to generating fonts is very interesting… feels like it could unlock the creation of heavily stylized fonts that just wouldn’t be feasible otherwise.

dctoedtover 1 year ago

"Computer, computer, make me a font | find me a glyph | catch me an X-height"(I'll show myself out.)

paulcnicholsover 1 year ago

Inevitable in a good way. Keep going! There's gold here.

itsyaboiover 1 year ago

<a href="https://www.youtube.com/watch?v=a8K6QUPmv8Q">https://www.youtube.com/watch?v=a8K6QUPmv8Q</a>

euroderfover 1 year ago

Has anyone tried using an LLM to make a font based on their handwriting ?EDIT: There's a couple (IIRC) of online services that offer this.

评论 #37754762 未加载

dexsstover 1 year ago

I used to make some fonts for rare, non Latin alphabets like the Orkhon script by hand using a Paint-like freewar, it was fun

rogualover 1 year ago

> THE QUICK BROWN FOX JUMPS OVER THE LAZY DOGIt's "a" quick brown fox, otherwise the sentence has no "a".

评论 #37754786 未加载

评论 #37754923 未加载

评论 #37754782 未加载

Nevermarkover 1 year ago

In honor of all the times he pressed his hands into his eyes (and myself doing the same thing):I present: “Perplexed” by Nilsa. [0]I have a print in my office, in lieu of a mirror.[0] <a href="https://www.sargentsfineart.com/img/nisla/all/nisla-perplexed.jpg" rel="nofollow noreferrer">https://www.sargentsfineart.com/img/nisla/all/nisla-perplexe...</a>

RugnirVikingover 1 year ago

Ooh I have to try this out when I get home, looks like the weights are under 1GB too

nitrofuranoover 1 year ago

lots of kernings misfits ftw

评论 #37753499 未加载

gigglesupstairsover 1 year ago

"Fucking Hell" - first thing I yelled to myself when I saw that headlineKudos for the project, of course, but it just saddens me a bit more. Nothing is sacred anymore.

评论 #37755194 未加载

评论 #37767292 未加载

评论 #37760102 未加载

评论 #37754697 未加载

评论 #37755656 未加载

评论 #37754387 未加载

评论 #37756062 未加载

评论 #37754699 未加载

kleibaover 1 year ago

Obligatory xkcd reference: <a href="https://xkcd.com/1015/" rel="nofollow noreferrer">https://xkcd.com/1015/</a>

matt3210over 1 year ago

Granted... You are now a font!

andybakover 1 year ago

Everyone knows that AIs can't draw sans...

30 comments

lachlan_grayover 1 year ago

评论 #37753956 未加载

评论 #37752882 未加载

评论 #37752872 未加载

评论 #37755177 未加载

martincmartinover 1 year ago

评论 #37755082 未加载

评论 #37758025 未加载

philipwhiukover 1 year ago

> To train the model, I assembled a dataset of 71k distinct fonts.I give it a week before Monotype sues your face off.

评论 #37752784 未加载

评论 #37753906 未加载

评论 #37757152 未加载

评论 #37756049 未加载

yklcsover 1 year ago

评论 #37755578 未加载

Jack000over 1 year ago

评论 #37758809 未加载

评论 #37758209 未加载

fennecfoxyover 1 year ago

boffinAudioover 1 year ago

评论 #37752214 未加载

评论 #37752818 未加载

scaryglidersover 1 year ago

评论 #37752056 未加载

评论 #37751935 未加载

Rantenkiover 1 year ago

评论 #37755888 未加载

评论 #37755919 未加载

评论 #37767092 未加载

PaulHouleover 1 year ago

Kinda funny how it works well at this whereas diffusion models go to die when it comes to drawing text but of course it works in a completely different manner.

评论 #37752834 未加载

评论 #37754127 未加载

scaryglidersover 1 year ago

dleeftinkover 1 year ago

评论 #37752036 未加载

评论 #37752236 未加载

logdahlover 1 year ago

Cool! Now generate 'upper-uppercase' and see what happens :^)

评论 #37752192 未加载

TheRealPomaxover 1 year ago

评论 #37752737 未加载

lawlessoneover 1 year ago

评论 #37753642 未加载

mastersummonerover 1 year ago

Poof! You're a font.

acid__over 1 year ago

dctoedtover 1 year ago

"Computer, computer, make me a font | find me a glyph | catch me an X-height"(I'll show myself out.)

paulcnicholsover 1 year ago

Inevitable in a good way. Keep going! There's gold here.

itsyaboiover 1 year ago

<a href="https://www.youtube.com/watch?v=a8K6QUPmv8Q">https://www.youtube.com/watch?v=a8K6QUPmv8Q</a>

euroderfover 1 year ago

Has anyone tried using an LLM to make a font based on their handwriting ?EDIT: There's a couple (IIRC) of online services that offer this.

评论 #37754762 未加载

dexsstover 1 year ago

I used to make some fonts for rare, non Latin alphabets like the Orkhon script by hand using a Paint-like freewar, it was fun

rogualover 1 year ago

> THE QUICK BROWN FOX JUMPS OVER THE LAZY DOGIt's "a" quick brown fox, otherwise the sentence has no "a".

评论 #37754786 未加载

评论 #37754923 未加载

评论 #37754782 未加载

Nevermarkover 1 year ago

RugnirVikingover 1 year ago

Ooh I have to try this out when I get home, looks like the weights are under 1GB too

nitrofuranoover 1 year ago

lots of kernings misfits ftw

评论 #37753499 未加载

gigglesupstairsover 1 year ago

"Fucking Hell" - first thing I yelled to myself when I saw that headlineKudos for the project, of course, but it just saddens me a bit more. Nothing is sacred anymore.

评论 #37755194 未加载

评论 #37767292 未加载

评论 #37760102 未加载

评论 #37754697 未加载

评论 #37755656 未加载

评论 #37754387 未加载

评论 #37756062 未加载

评论 #37754699 未加载

kleibaover 1 year ago

Obligatory xkcd reference: <a href="https://xkcd.com/1015/" rel="nofollow noreferrer">https://xkcd.com/1015/</a>

matt3210over 1 year ago

Granted... You are now a font!

andybakover 1 year ago

Everyone knows that AIs can't draw sans...