I made a real-time C/C++/Rust build visualizer

Night_Thastus · 2025-08-14T18:42:34 1755196954

I am extremely interested in this.

I am stuck in an environment with CMake, GCC and Unix Make (no clang, no ninja) and getting detailed information about WHY the build is taking so long is nearly impossible.

It's also a bit of a messy build with steps like copying a bunch of files from the source into the build folder. Multiple languages (C, C++, Fortran, Python), custom cmake steps, etc.

If this tool can handle that kind of mess, I'll be very interested to see what I can learn.

hagendaasalpine · 2025-08-15T07:50:46 1755244246

Tsoding wrote https://github.com/tsoding/nob.h, single header C library for cross platform builds, only requirement is cc. GDB profiling tools can then be used to look at your build step. It's a neat idea. I suspect this is not an option but Nix is great build tool if you are dealing with multiple languages.

jppittma · 2025-08-15T11:17:31 1755256651

Btw, he has a YouTube channel and streams. I recommend it if you’re seeking imposter syndrome.

wakawaka28 · 2025-08-15T00:15:44 1755216944

It's not "nearly impossible" but actually built in: https://cmake.org/cmake/help/latest/manual/cmake.1.html#cmdo... For the actual compile time you can easily insert a wrapper script. To be honest I haven't done that in over 4 years, but it has been done by many and it is easy.

There may be times when CMake itself is the bottleneck but it is almost certainly an issue with your dependencies and so on. CMake has many features to assist you in speeding up your compile and link time too. But it would take a series of blog posts to describe how you should try to speed it up.

Night_Thastus · 2025-08-15T16:58:57 1755277137

Does that just profile cmake's time to configure and generate, or the underlying compile of each file as well? Configuration and generation are only seconds when done from scratch - the build is more like 20 minutes.

Just trying to add that argument with 3.26.5 on Rocky Linux 9, I get 'Unknown argument --profiling-format=google-trace'.

Not sure why, as cmake --help clearly states it should be there...

  --profiling-format=<fmt>     = Output data for profiling CMake scripts.
                                 Supported formats: google-trace
  --profiling-output=<file>    = Select an output path for the profiling data
                                 enabled through --profiling-format.

tom_ · 2025-08-15T17:37:01 1755279421

You might need to specify --profiling-output= as well. I get an error fdrom cmake 3.31 if only the format is provided: CMake Error: --profiling-format specified but no --profiling-output!

Anyway, it looks like it only profiles the configure/generate steps. Not much use on Linux, but on Windows/macOS, perhaps. Due to lack of any standard package manager, it's a good idea to build every dependency from source on those OSs, and the time can mounts up.

My project is not that large, but it takes 1 minute to configure from scratch on Windows, and 10 minutes (!) on macOS.

Night_Thastus · 2025-08-15T17:59:34 1755280774

I specified both. In any case, yeah, cmake time isn't useful in my case.

Bizarre that you see 10 minutes on MacOS. Something's definitely busted there. It's not even that bad for me on Windows, and that's saying something.

tom_ · 2025-08-15T20:30:20 1755289820

For the record: the cmake profiling data did help a bit. On macOS it seems that the configure stage does 200+ try_compiles, largely due to SDL2, and each one somehow takes ~2.3 seconds. So that's 460 seconds right there. And regarding the total time, it's actually more like 480 seconds. (I must have misremembered! Or perhaps my laptop is measurably faster when its integrated GPU isn't driving 2 external displays.)

On Windows: 50 try_compiles, about half and half SDL2 and libuv, and each one takes ~1 second.

I don't think either of these try_compile turnaround times is acceptable (what is it doing?! I bet it's like 50 ms on Linux) but the total figure does now feel a bit less mysterious.

wakawaka28 · 2025-08-17T14:09:08 1755439748

You might also like my suggestions in this comment: https://news.ycombinator.com/item?id=44931551

wakawaka28 · 2025-08-17T13:53:18 1755438798

That does just do CMake time. Sorry if my comment was confusing. I forgot that I used a wrapper or something myself to collect the compile times. Anyway, my point is, you aren't the first person to have this issue. I've definitely profiled compile times before and viewed a flame graph with some gadget made by Google. The easiest way you to do it may depend on your compiler and build system. You could attack the problem from Ninja or Make, or by substituting your compiler binary for a wrapper script.

I also want to point out that if you want Ninja, it is a very minimal binary to build with no dependencies that wouldn't already be installed.

This is one of the simpler ways to find out what's going on: https://alangrow.com/blog/profiling-every-command-in-a-makef... Instead of replacing the shell you can also replace CC or CXX variables (if I recall properly) either in CMake or the Makefile to replace your compiler with a wrapper script that logs stuff.

More generalized Makefile profiling: https://github.com/konturio/make-profiler

Profiling with Clang: https://aras-p.info/blog/2019/01/16/time-trace-timeline-flam... (Maybe what I used last time...)

I am pretty sure you can get shiny outputs from a free profiling tool, but I think you can search for that yourself. It should be easy to find something. Good luck!

unddoch · 2025-08-14T20:14:22 1755202462

I wrote a little GCC plugin for compile time tracing/profiling, if that's something you're interested in: https://github.com/royjacobson/externis

Night_Thastus · 2025-08-15T16:35:44 1755275744

I just went ahead and tried it out :)

I can get it to work for some sub-sets of our project, but for quite a bit of it I get the following error:

cc1: error: cannot load plugin /opt/rh/gcc-toolset-13/root/usr/lib/gcc/x86_64-redhat-linux/13/plugin/externis.so: /opt/rh/gcc-toolset-13/root/usr/lib/gcc/x86_64-redhat-linux/13/plugin/externis.so: undefined symbol: _Z14decl_as_stringP9tree_nodei

I suspect this is because these are C or Fortran sub-projects. I'm looking for some clean way to tell Cmake to apply externis to all the C++ only subprojects if possible. I'll see what I can come up with.

I'd also like to know, if multiple GCC commands end up pointing to the same trace.json, especially in a parallel build, will externis automagically ensure that it doesn't step over itself?

Night_Thastus · 2025-08-15T18:28:28 1755282508

I figured out a way to set it at a top level, so it only happens with C++ files:

target_compile_options(${NAME} PUBLIC

$<$<COMPILE_LANGUAGE:CXX>:-fplugin=externis -fplugin-arg-externis-trace-dir=(where I want to put traces)>

)

But as I suspected, it is not a single trace file. It's thousands of trace files. Is there some way to collate all the data into one larger picture of how the build progressed?

phaedrus · 2025-08-14T21:00:20 1755205220

When I was trying to improve compile time for my game engine, I ended up using compiled size as a proxy measure. Although it is an imperfect correlation, the fact that compiled size is deterministic across build runs and even across builds on different machines makes it easier to work with than wall clock time.

mlsu · 2025-08-14T21:42:11 1755207731

Wait, this is not intuitive at all for me.

If the compiler is working harder wouldn't that result in a more compact binary? Maybe I'm thinking too much from an embedded software POV.

I suppose the compiler does eventually do IO but IO isn't really the constraint most of the time right?

staticfloat · 2025-08-14T21:49:22 1755208162

While you can cause the compiler to run longer to squeeze the binary size down, the compiler has a baseline number of compiler passes that it runs over the IR of the program being compiled. These compiler passes generally take time proportional to the input IR length, so a larger program takes longer to compile. Most compiler passes aren't throwing away huge amounts of instructions (dead code elimination being a notable exception, but the analysis to figure out which pieces of dead code can be eliminated still is operating on the input IR). So it's not a perfect proxy, but in general, if the output of your compiler is 2MB of code, it probably took longer to process all the input and spit out that 2MB than if the output of your compiler was 200KB.

johannes1234321 · 2025-08-14T21:50:22 1755208222

Of course there are the cases where a huge template structure with complex instantiation and constexpr code compiles down to a single constant, but for most parts of the code I would assume there is a proportion from code size, via compile time to binary size.

fransje26 · 2025-08-15T11:15:43 1755256543

> I am stuck in an environment with CMake, GCC and Unix Make (no clang, no ninja) and getting detailed information about WHY the build is taking so long is nearly impossible.

I have a similar problem, with a tangential question that I think about from time to time without really having the time to investigate it further, unfortunately.

I notice sometimes that CMake recompiles files that shouldn't have been affected by the code changes made previously. Like recompiling some independent objects after only slight changes to a .cpp file without any interface changes.

So I often wonder if CMake is not making some file more inter-dependent than what they are, leading to longer compilation times.

1718627440 · 2025-08-14T23:57:34 1755215854

Can you set CC=time gcc ?

pklausler · 2025-08-14T21:17:37 1755206257

strace might help, if you have it.

touisteur · 2025-08-16T17:21:03 1755364863

perf record -e "syscalls:,sched:" is my go-to there - or just apply it on whole-system, aasone does.

Mawr · 2025-08-14T23:11:27 1755213087

Suggestion to the blog author - put:

> Here it is recording the build of a macOS app:

> <gif>

At the top of the page, it should be right under the header.

You made a thing, so show the thing. You can waffle on about it later. Just show the thing.

dhooper · 2025-08-15T01:11:13 1755220273

Good suggestion. Updated.

hdjrudni · 2025-08-15T02:51:39 1755226299

Good job. This caught my eye. I don't think the visual even needs much of an explanation, I can see what it's doing.

entelechy · 2025-08-14T20:22:40 1755202960

Love it! We did something similar using strace/dtruss back in 2018 with https://buildinfer.loopperfect.com/ and were generating graphs (using eg. graphviz and perfetto.dev) and BUCK files on the back of that

Whilst we regrettably never came around to package it as a propper product, we found it immensly valuable in our consulting work, to pinpoint issues and aid the conversion to BUCK/Bazel. We used graphviz, https://perfetto.dev/ and couple other tools to visualise things

Recently we cicled back to this too but with a broader usecase in mind.

There are some inherent technical challanges with this approach & domain:

- syscall logs can get huge - especially when saved to disk. Our strace logs would get over 100GB for some projects (llvm was around ~50GB)

- some projects also use https and inter process communications and that needs ot be properly handled too. (We even had a customer that was retriving code from a firebird database via perl as part of the compilation step!)

- It's runtime analysis - you might need to repeat the analysis for each configuration.

flakes · 2025-08-15T07:01:45 1755241305

Curious, what were you using for doing syscall logging? LD_PRELOAD tricks, or ebpf filtering?

entelechy · 2025-08-16T12:43:31 1755348211

mostly strace and it's macos equivalent; Later we moved to ptrace and ebpf. LD_PRELOAD unfortunately doesn't work for statically linked libc. There are also kernel probes but didn't like that it required root permissions...

bgirard · 2025-08-14T18:11:47 1755195107

That's really cool. Fascinating to think about all the problems that get missed due to poor or missing visualizations like this.

I did a lot of work to improve the Mozilla build system a decade ago where I would have loved this tool. Wish they would have said what problem they found.

dhooper · 2025-08-14T18:28:19 1755196099

(OP here) Thanks!

My call with the Mozilla engineer was cut short, so we didn't have time to go into detail about what he found, I want to look into it myself.

bvisness · 2025-08-14T21:50:52 1755208252

Hello, I am the engineer in question. I am not actually super familiar with the details of the build system, but from when I saw, the main issues were:

- Lots of constant-time slowness at the beginning and end of the build

- Dubious parallelism, especially with unified builds

- Cargo being Cargo

Overall it mostly looks like a soup of `make` calls with no particular rhyme or reason. It's a far cry from the ninja example the OP showed in his post.

epage · 2025-08-15T11:42:18 1755258138

What `cargo being cargo` problems are you having?

tom_ · 2025-08-14T21:25:53 1755206753

If you use the Visual C++ compiler on Windows, vcperf is worth a look: https://github.com/microsoft/vcperf - comes with VS2022, or you can build from github.

I've used it with projects generated by UBT and CMake. I can't remember if it provides any info that'd let you assess the quality of build parallelism, but it does have some compiler front end info which is pretty straightforward to read. Particularly expensive headers (whether inherently so, or just because they're included a lot) are easy to find.

muststopmyths · 2025-08-15T13:52:49 1755265969

Also Incredibuild. The free version is probably good enough to visualize your build and see any bottlenecks.

bdash · 2025-08-15T02:38:22 1755225502

I've had success using https://github.com/nico/ninjatracing along with Clang's `-ftime-trace` to visualize the build performance of a C++ project using CMake. https://github.com/aras-p/ClangBuildAnalyzer helps further break down what the compiler is spending its time on.

boris · 2025-08-14T19:09:19 1755198559

> It also has 6 seconds of inactivity before starting any useful work. For comparison, ninja takes 0.4 seconds to start compiling the 2,468,083 line llvm project. Ninja is not a 100% fair comparison to other tools, because it benefits from some “baked in” build logic by the tool that created the ninja file, but I think it’s a reasonable “speed of light” performance benchmark for build systems.

This is an important observation that is often overlooked. What’s more, the changes to the information on which this “baked in” build logic is based is not tracked very precisely.

How close can we get to this “speed of light” without such “baking in”? I ran a little benchmark (not 100% accurate for various reasons but good enough as a general indication) which builds the same project (Xerces-C++) both with ninja as configured by CMake and with build2, which doesn’t require a separate step and does configuration management as part of the build (and with precise change tracking). Ninja builds this project from scratch in 3.23s while build2 builds it in 3.54s. If we omit some of the steps done by CMake (like generating config.h) by not cleaning the corresponding files, then the time goes down to 3.28s. For reference, the CMake step takes 4.83s. So a fully from-scratch CMake+ninja build actually takes 8s, which is what you would normally pay if you were using this project as a dependency.

remexre · 2025-08-14T19:25:50 1755199550

> What’s more, the changes to the information on which this “baked in” build logic is based is not tracked very precisely.

kbuild handles this on top of Make by having each target depend on a dummy file that gets updated when e.g. the CFLAGS change. It also treats Make a lot more like Ninja (e.g. avoiding putting the entire build graph into every Make process) -- I'd be interested to see how it compares.

torarnv · 2025-08-15T08:23:19 1755246199

Awesome!! Are you planning to open source this? I’ve been working on something similar and would love to join forces!

CyberDildonics · 2025-08-14T20:13:17 1755202397

I love the visualization, I think it's great information and will be very helpful to whoever uses it.

I would think about a different name. Often names are either meant to be funny or just unique nonsense but something short and elegantly descriptive (like BuildViz etc.) can go a long way to making it seem more legitimate and being more widely used.

dhooper · 2025-08-14T20:29:58 1755203398

Thanks CyberDildoNics!

1718627440 · 2025-08-15T14:58:15 1755269895

I'm interested to try, but don't get what that early access signup is about. Where is the email address going to show up? Can I use a temp mail?

dhooper · 2025-08-15T15:40:37 1755272437

I used the term "early access" when I should've used "private beta". The sign up is just for me to have an email addresses to send the private beta when the next update goes out, and I'll follow up for feedback. Nothing sinister is happening.

1718627440 · 2025-08-15T16:33:42 1755275622

> to send the private beta when the next update goes out

Meaning I only can try the software after sign up, or did I miss the obvious repo link?

What language is this project written in, what build system does it use itself?

I just don't feel comfortable pasting my email address into a Google site. I can use a temp mail, but I will loose access to it in a few minutes/hours, so I don't know if that would annoy you.

What kind of times do you expect in the form, serial or parallel build time? What kind of file do you want to be modified. When I modify main.c, basically nothing gets rebuild, when I modify the central header file, it will be like a total rebuild. Can you clarify that in the form?

zorgmonkey · 2025-08-15T21:44:19 1755294259

You can make an email you don't care about with protonmail, I recommend them cause they don't require you to enter an existing email address or a phone number when signing up.

hiccuphippo · 2025-08-14T20:31:32 1755203492

Name checks out.

aanet · 2025-08-14T18:48:54 1755197334

This is fabulous!!

Is there a version available for MacOS today?? I'd love to give it a whirl... For Rust, C++ / Swift and other stuff.

Thanks!

dhooper · 2025-08-14T18:52:11 1755197531

I'll be sending out the a macOS version to another wave of beta users after I fix an outstanding issue, if you sign up (at bottom of article) and mention this comment I can make sure you're in that wave.

aanet · 2025-08-14T19:01:52 1755198112

Thanks. Signed up

Night_Thastus · 2025-08-14T18:52:21 1755197541

It looks like it doesn't have a public release for any OS yet, but has a way to enter for early access.

saagarjha · 2025-08-15T04:45:21 1755233121

I've done something similar by running Instruments during the build, which not only tells me which processes are running when but also what they're doing. Unfortunately Instruments gets upset if your build takes a long time, and it doesn't really allow filtering by process tree, but it helped shipped several major wins for our builds when I was working on Twitter's iOS codebase. Alas trying to this these days will not work because Instrument's "All Processes" tracing has been broken for a while (FB14533747).

forrestthewoods · 2025-08-14T19:47:09 1755200829

This is great. I was skeptical from the title but the implementation is very clever. This could be a super super useful tool for the industry.

JackYoustra · 2025-08-15T01:49:40 1755222580

For anyone using xcode: there's a builtin button to show a visualization for a build (not realtime afaik) for it too.

audiofish · 2025-08-14T22:10:51 1755209451

Really cool tool, but perhaps not for the original use-case. I often find myself trying to figure out what call tree a large Bash script creates, and this looks like it visualises it well.

This would have been really useful 6 months ago, when I was trying to figure out what on earth some Jetson tools actually did to build and flash an OS image.

pjmlp · 2025-08-15T07:44:23 1755243863

Great piece of work.

Without trying to devalue it, note that VS and XCode have similar visualization tools.

xuhu · 2025-08-14T18:22:44 1755195764

Is there a tool that records the timestamp of each executed command during a build, and when you rebuild, it tells you how much time is left instead of "building obj 35 out of 1023" ?

Or (for cmake or ninja) use a CSV that says how long each object takes to build and use it to estimate how much is left ?

dhooper · 2025-08-14T18:33:47 1755196427

OP Here. Thats an interesting idea. What The Fork knows all the commands run, and every path they read/write, so I should be able to make it estimate build time just by looking at what files were touched.

tiddles · 2025-08-14T18:54:55 1755197695

Nice, I’ve been looking for something like this for a while.

I’ve noticed on my huge catkin cmake project that cmake is checking the existence of the same files hundreds of times too. Is there anything that can hook into fork() and provide a cached value after the first invocation?

lights0123 · 2025-08-14T19:08:33 1755198513

My tips for speeding up builds (from making this same project but with ebpf):

- switch to ninja to avoid that exact issue since CMake + Make spawns a subprocess for every directory (use the binary from PyPi for jobserver integration)

- catkin as in ROS? rm /opt/ros/noetic/etc/catkin/profile.d/99.roslisp.sh to remove 2 python spawns per package

ethan_smith · 2025-08-15T07:36:38 1755243398

You could try ccache with the CCACHE_SLOPPINESS=file_stat_matches option, or implement a filesystem-level caching proxy like CachingFS or FUSE-based solutions that intercept and cache those redundant stat() calls.

supportengineer · 2025-08-14T18:34:35 1755196475

Amazing! Great job!

What limits your tool to compiler/build tools, can it be used for any arbitrary process?

dhooper · 2025-08-14T18:39:45 1755196785

Thank you! Yeah it can be used for any type of program, but I haven't been able to think of anything besides compilation that creates enough processes to be interesting. I'm open to ideas!

DiddlyWinks · 2025-08-14T18:44:05 1755197045

Video encoding and 3-D rendering are a couple that come to mind; I'd think they'd launch quite a few.

This looks like a really cool tool!

shakna · 2025-08-15T07:06:39 1755241599

Just as a random example in the area, I had a project where I transformed every frame in a video, using a custom binary, before encoding them back to the video.

Hundreds of thousands of processes were normal.

jeffbee · 2025-08-14T20:05:28 1755201928

This seems like a good place to integrate a Bazel Build Event Protocol stream consumer.

MathMonkeyMan · 2025-08-14T20:11:42 1755202302

I was going to comment that "what the fork" might not work for a client/server build system like Bazel, but now I have something to google instead.

epage · 2025-08-14T23:42:30 1755214950

How does this compare to `cargo check --timings`?

It visualizes each crate's build, shows the dependencies between them, shows when the initial compilation is done that unblocks dependents, and soon will have link information.

tempodox · 2025-08-15T07:00:25 1755241225

I would be interested and even pay for it, but I am not a joiner. And I don't want Google to know any of my email addresses.

terabytest · 2025-08-15T13:26:15 1755264375

It looks really nice. I wonder if it’d be possible to break it down even further by somehow instrumenting the actual processes and including their execution flame graphs as part of the chart. That would expose a ton of extra information about the large gaps of “inactivity”.

proctorg76 · 2025-08-14T21:19:53 1755206393

the parallels between tech and manufacturing never cease to amaze, this looks so much like the machine monitoring / execution system we use in the car parts plant I want to ask if you've calculated the TEEP and OEE of your build farm

secondcoming · 2025-08-15T14:52:41 1755269561

This looks awesome.

I've used clang's -ftime-trace option in the past and that's also really good. It's a pity gcc has nothing similar.

rustystump · 2025-08-15T00:05:12 1755216312

10/10 this is very cool and the kind of hacking i come here for

corysama · 2025-08-14T18:34:24 1755196464

Looks like a general `fork()` visualizer to me. Which is great!

ItsHarper · 2025-08-15T14:53:01 1755269581

I like that the name reflects its broad usefulness

time4tea · 2025-08-15T00:32:25 1755217945

Then I hid it away.

This is an ad not a helpful announcement.

lsuresh · 2025-08-15T16:22:15 1755274935

Would love to have our team try this out (we have some ridiculous rust builds).

bitbasher · 2025-08-15T04:17:24 1755231444

This is cool, but for Rust you have `cargo build --timings` built in and has even more details.

mrlonglong · 2025-08-14T21:22:14 1755206534

Does it work with cmake?

ItsHarper · 2025-08-15T14:55:08 1755269708

A CMake build is used as an example in the blog post. It's literally just visualizing all spawned processes, so it will work for anything that spawns subprocesses to do the build.

MBCook · 2025-08-15T03:19:32 1755227972

There is an example cmake graph in the article.

mrlonglong · 2025-08-16T09:16:46 1755335806

Thanks.

emigre · 2025-08-15T11:28:25 1755257305

Really nice and interesting! Thanks!

Cloudef · 2025-08-14T21:21:58 1755206518

LLVM is taking its sweet time, brew coffee

brcmthrowaway · 2025-08-14T18:58:02 1755197882

What about OSes that dont use fork()?

dhooper · 2025-08-14T18:59:10 1755197950

I use whatever the equivalent is on that OS.

mgaunard · 2025-08-14T18:54:12 1755197652

The real solution is to eliminate build systems where you have to define your own targets.

Developers always get it wrong and do it badly.

kirito1337 · 2025-08-14T20:00:58 1755201658

That's interesting.

metalliqaz · 2025-08-14T21:50:59 1755208259

Isn't `wtf` already a fairly common command?

Surac · 2025-08-14T18:28:24 1755196104

but why? I have to admit it's a fun project

rvrb · 2025-08-14T18:49:46 1755197386

here, I'll copy the first paragraph of TFA for you:

> Many software projects take a long time to compile. Sometimes that’s just due to the sheer amount of code, like in the LLVM project. But often a build is slower than it should be for dumb, fixable reasons.

klik99 · 2025-08-14T19:06:58 1755198418

Nice! Leaving a comment to easily find this later, dont have anything to add except this looks cool