More

superlopuh · 2026-03-17T07:47:29 1773733649

In my experience a culture where teammates prioritise review times (both by checking on updates in GH a few times a day, and by splitting changes agressively into smaller patches) is reflected in much faster overall progress time. It's definitely a culture thing, there's nothing technically or organisationally difficult about implementing it, it just requires people working together considering team velocity more important than personal velocity.

threatofrain · 2026-03-17T08:03:53 1773734633

Let's say a teammate is writing code to do geometric projection of streets and roads onto live video. Another teammate is writing code to do automated drone pursuit of cars. Let's say I'm over here writing auth code, making sure I'm modeling all the branches which might occur in some order.

To what degree do we expect intellectual peerage from someone just glancing into this problem because of a PR? I would expect that to be the proper intellectual peer of someone studying the problem, it's quite reasonable to basically double your efforts.

pm215 · 2026-03-17T08:36:54 1773736614

If the team is that small and working on things that are that disparate, then it is also very vulnerable to one of those people leaving, at which point there's a whole part of the project that nobody on the team has a good understanding of.

Having somebody else devote enough time to being up to speed enough to do code review on an area is also an investment in resilience so the team isn't suddenly in huge difficulty if the lone expert in that area leaves. It's still a problem, but at least you have one other person who's been looking at the code and talking about it with the now-departed expert, instead of nobody.

servo_sausage · 2026-03-17T08:20:53 1773735653

This is an unusually low overlap per topic; probably needs a different structure to traditional prs to get the best chance to benefit from more eyes... Higher scope planning or something like longer but intermittent partner programming.

Generally if the reviewer is not familiar with the content asynchronous line by line reviews are of limited value.

superlopuh · 2026-03-14T15:06:25 1773500785

Missing Muna[0][1], I'm curious how it would compare on these benchmarks.

[0]: https://www.muna.ai/ [1]: https://docs.muna.ai/predictors/create

superlopuh · 2026-01-02T10:21:06 1767349266

I'm surprised that the `isinstance()` comparison is with `type() == type` and not `type() is type`, which I would expect to be faster, since the `==` implementation tends to have an `isinstance` call anyway.

superlopuh · 2026-01-02T10:22:16 1767349336

Also seems like the repo is now private, so I can't open an issue, or reproduce the numbers.

superlopuh · 2025-12-17T12:26:18 1765974378

We've been relying on TypeForm (an experimental feature in Pyright) in xDSL. Since there are some Astral members commenting here: are there any plans to support TypeForm any time soon? It seems like you already have some features that go beyond the Python type spec, so I feel like there may be hope

dcreager · 2025-12-17T17:10:00 1765991400

Yes, we love TypeForm! We plan to support it as soon as the PEP for it lands. Under the covers, we already support much of what's needed, and use it for some of our special-cased functions like `ty_extensions.is_equivalent_to` [1,2]. TypeForm proper has been lower on the priority list mostly because we have a large enough backlog as it is, and that lets us wait to make sure there aren't any last-minute changes to the syntax.

[1] https://github.com/astral-sh/ruff/blob/0bd7a94c2732c232cc142...

[2] https://github.com/astral-sh/ruff/blob/0bd7a94c2732c232cc142...

superlopuh · 2025-10-23T14:15:52 1761228952

In MLIR, there are two representations of memory, `tensor` and `memref`, which enables you to do some high-level things[0] in SSA before "bufferizing" to memrefs, which are eventually lowered to LLVM pointers.

[0]: https://mlir.llvm.org/docs/Dialects/TensorOps/

superlopuh · 2025-09-24T06:51:55 1758696715

Python with Pyright in strict mode. I work on a ~200kLOC fully typed Python project [0] and am having fun.

[0]: https://github.com/xdslproject/xdsl

hk__2 · 2025-09-24T08:35:32 1758702932

Have you worked with TypeScript? I’m working with both every day and I’m always frustrated by the limits of the 'type' system in Python- sure it’s better than nothing but it’s so basic compared to what you can do in TypeScript. It’s very easy to use advanced generics in TypeScript but a hell to do (sometimes outright impossible) in Python.

superlopuh · 2025-09-24T09:44:53 1758707093

Yep, although never in a project of a similar size. One advantage of the Python setup is that the types are ignored at runtime, so there's no overhead at startup/compilation time. Although it's also a disadvantage in terms of what you can do in the system, of course.

hasperdi · 2025-09-24T11:44:21 1758714261

Deno and latest versions of Nodejs run TS code without transpilation

IshKebab · 2025-09-24T11:18:38 1758712718

I agree it is pretty nice (with uv and as long as you REALLY don't care about performance). But even if you are one of the enlightened few to use that setup, you still have to deal with dependencies that don't have type annotations, or only basic ones like `dict`.

Typescript (via Deno) is still a better option IMO.

superlopuh · 2025-06-30T17:27:46 1751304466

Can someone familiar with performance of LLMs please tell me how important this is to the overall perf? I'm interested in looking into optimizing tokenizers, and have not yet run the measurements. I would have assumed that the cost is generally dominated by matmuls but am encouraged by the reception of this post in the comments.

refibrillator · 2025-06-30T17:49:55 1751305795

Tokenization is typically done on CPU and is rarely (if ever) a bottleneck for training or inference.

GPU kernels typically dominate in terms of wall clock time, the only exception might be very small models.

Thus the latency of tokenization can essentially be “hidden”, by having the CPU prepare the next batch while the GPU finishes the current batch.

serjester · 2025-06-30T17:44:54 1751305494

Tokenizing text is ridiculously small part of the overall computation that goes into serving a request. With that said if you’re doing this on petabytes of data, never hurts to have something faster.

odyssey7 · 2025-06-30T18:53:28 1751309608

A language that isn’t memory-safe can definitely hurt. AI needs more security, not less.

matthewolfe · 2025-06-30T22:14:41 1751321681

To echo the other replies, the tokenizer is definitely not the bottleneck. It just happens to be the first step in inference, so it's what I did first.

benreesman · 2025-06-30T20:04:02 1751313842

Tokenization performance is complicated, but your guidepost is that the institutions with the resources and talent to do so choose to write extremely fast tokenizers: sentencepiece and tiktoken both pay dearly in complexity (particularly complexity of deployment because now you've got another axis of architecture-specific build/bundle/dylib to manage in addition to whatever your accelerator burden always was: its now aarch64 cross x86_64 cross CUDA capability...)

Sometimes it can overlap with accelerator issue, but pros look at flame graphs: a CPU core running the AVX lanes hard isn't keeping the bus fed, million things. People pre-tokenize big runs all the time.

I don't know why this thread is full of "nothing to see here", this obliterates the SOTA from the money is no object status quo: I'd like to think better of the community than the obvious which is that C++ is threatening a modest mindshare comeback against a Rust narrative that's already under pressure from the explosion of interest in Zig. Maybe there's a better reason.

superlopuh · 2025-06-19T12:30:01 1750336201

I really want to switch to Zed from Cursor but the battery usage for my Python project with Pyright is unjustifiable. There are issues for this on GitHub and I'm just sad that the team isn't prioritising this more.

vinnymac · 2025-06-19T14:02:53 1750341773

It’s funny you mention this because I have an issue with Cursor where if I leave it open for more than a few hours my M3 Max starts to melt, the fans spin up, and it turns out to be using most of my CPU when it should be idling.

Zed on the other hand works perfectly fine for me. Goes to show how a slightly different stack can completely change one’s experiences with a tool.

superlopuh · 2025-06-13T09:15:04 1749806104

I love that language and frequently show it to people. I'm sad to see that my local install doesn't work any more. I actually used it to solve a puzzle in Evoland 2 that I'm relatively sure was added as a joke, and is not solvable in a reasonable time without a solver. I'm actually doing a PhD in compilers right now, and would love to chat about sentient if you have the time. My email is sasha@lopoukhine.com.

mzl · 2025-06-23T11:31:19 1750678279

You might be interested in looking at MiniZinc (https://minizinc.org/) which is an open source modelling language for combinatorial problems. The system comes from a constraint programming background but the language is solver agnostic can be used to compile into many different types of solvers.

superlopuh · 2025-06-09T22:30:22 1749508222

I just get a bunch of stars.

TRiG_Ireland · 2025-06-09T23:32:23 1749511943

    $ traceroute bad.horse --resolve-hostnames
    traceroute to bad.horse (162.252.205.157), 64 hops max
      1   192.168.218.150 (_gateway)  0.483ms  0.580ms  0.548ms 
      2   *  *  * 
      3   172.24.41.146 (172.24.41.146)  35.920ms  29.950ms  30.014ms 
      4   *  *  * 
      5   172.24.195.1 (172.24.195.1)  30.097ms  24.669ms  28.539ms 
      6   172.24.195.10 (172.24.195.10)  30.086ms  40.166ms  31.028ms 
      7   80.233.113.58 (80.233.113.58)  29.500ms  23.967ms  43.113ms 
      8   185.6.36.58 (3ireland.ipv4.sw01.inex.ie)  33.895ms  24.975ms  34.959ms 
      9   185.6.36.86 (inex-01.ixp.dubie.as8220.net)  22.309ms  19.770ms  19.886ms 
     10   217.163.150.193 (1-1-c21-1.ear2.Dublin6.Level3.net)  36.257ms  41.033ms  39.510ms 
     11   4.69.218.54 (ae1.8.bar4.Toronto1.net.lumen.tech)  113.272ms  112.558ms  126.953ms 
     12   4.16.51.30 (level3-gw.core02.tor1.prioritycolo.com)  115.108ms  123.140ms  109.593ms 
     13   67.223.96.90 (67.223.96.90)  121.502ms  119.570ms  121.227ms 
     14   162.252.205.130 (bad.horse)  113.161ms  116.657ms  133.119ms 
     15   162.252.205.131 (bad.horse)  119.704ms  128.309ms  116.360ms 
     16   162.252.205.132 (bad.horse)  125.646ms  125.660ms  130.224ms 
     17   162.252.205.133 (bad.horse)  132.803ms  122.749ms  126.137ms 
     18   162.252.205.134 (he.rides.across.the.nation)  133.363ms  132.422ms  145.895ms 
     19   162.252.205.135 (the.thoroughbred.of.sin)  154.954ms  136.148ms  142.345ms 
     20   162.252.205.136 (he.got.the.application)  158.392ms  151.055ms  148.331ms 
     21   162.252.205.137 (that.you.just.sent.in)  156.020ms  146.436ms  159.945ms 
     22   162.252.205.138 (it.needs.evaluation)  167.918ms  157.544ms  171.169ms 
     23   162.252.205.139 (so.let.the.games.begin)  161.076ms  173.227ms  162.558ms 
     24   162.252.205.140 (a.heinous.crime)  182.682ms  175.853ms  167.833ms 
     25   162.252.205.141 (a.show.of.force)  186.703ms  222.181ms  590.896ms 
     26   162.252.205.142 (a.murder.would.be.nice.of.course)  602.997ms  175.406ms  179.810ms 
     27   162.252.205.143 (bad.horse)  196.847ms  187.985ms  190.071ms 
     28   162.252.205.144 (bad.horse)  193.553ms  184.281ms  196.659ms 
     29   162.252.205.145 (bad.horse)  187.502ms  195.218ms  194.829ms 
     30   162.252.205.146 (he-s.bad)  207.271ms  194.007ms  200.673ms 
     31   162.252.205.147 (the.evil.league.of.evil)  204.942ms  200.672ms  199.780ms 
     32   162.252.205.148 (is.watching.so.beware)  202.099ms  214.924ms  213.818ms 
     33   162.252.205.149 (the.grade.that.you.receive)  233.816ms  220.345ms  207.857ms 
     34   162.252.205.150 (will.be.your.last.we.swear)  229.135ms  224.107ms  221.103ms 
     35   162.252.205.151 (so.make.the.bad.horse.gleeful)  272.907ms  241.476ms  222.392ms 
     36   162.252.205.152 (or.he-ll.make.you.his.mare)  250.987ms  224.407ms  223.308ms 
     37   162.252.205.153 (o_o)  246.874ms  234.844ms  234.278ms 
     38   162.252.205.154 (you-re.saddled.up)  269.535ms  241.234ms  238.331ms 
     39   162.252.205.155 (there-s.no.recourse)  243.611ms  244.293ms  236.522ms 
     40   162.252.205.156 (it-s.hi-ho.silver)  241.793ms  268.596ms  281.158ms 
     41   162.252.205.157 (signed.bad.horse)  286.400ms  267.379ms  280.296ms

superlopuh · 2025-06-13T11:39:37 1749814777

Beautiful

toast0 · 2025-06-09T23:26:16 1749511576

A bunch of stars aren't a very good horse?