Show HN: StackScope – I crawled over 40k indie launches to see what they ship

stackscope.dev

60 points by datafreak_ 1 day ago

Hey all, I built StackScope, a crawler/catalogue that looks at new product launches and shows what they were built with.

It watches launches from Product Hunt, Show HN, and PeerPush, then crawls the public site behind each one. The goal is to show what people actually launched with: hosting, frameworks, analytics, DNS, security headers, legal pages, AI-builder signals, and other public clues.

I started building it because most stack-detection sites look at the web as a whole. I was more interested in the current indie launch scene: what people are choosing right now, at the point they first put something in public.

A few implementation details: it runs on .NET, uses Playwright for rendered pages, and has a first-party fingerprint catalogue rather than one copied from Wappalyzer/etc. robots.txt is honoured, and the bot identifies itself.

Frustratingly, I am still waiting for verified bot status from Cloudflare and currently that knocks out about 10% of all sites.

There is also a private readiness check: paste a URL, get the same style of report, fix things, and recrawl. No account or email needed.

I'd be interested in feedback on the usefulness of this, the methodology, and any obvious false positives.

Jonathan.

schamack 2 hours ago

That looks very impressive! And really insightful as well. And to my surprise, I even found my startup on the landing page :D (possibly due to fresh HN post here as well).

I will have a thorough look on the insights later, as there might either be some small false positives, or my projects still needs some grunt work :).

So from my perspective, being part of the check, I find it extremely useful.

  • datafreak_ 37 minutes ago

    Let me know if anything is detected wrong!

pixel_popping 1 day ago
  • datafreak_ 1 day ago

    What's your response header content? Reads like it's being cf-mitigated.

    Edit: loosened a cf rule, should be good now.

addedlovely 1 day ago

Nice one.

I've been doing the same bit wider scope, for the whole Crux list, pruned to apex domains, and looking for CMS signals - how's your throughput?

I'm not doing any headless browser stuff, or many requests, so hyper optimised for speed.

I do grab robots.txt - didn't really see much in llms.txt or humans.txt in the wild, does yours?

  • addedlovely 1 day ago

    Ohh Cloudflare verified bot status, interesting I'll check that out.

    I'm seeing about 6.6% block rate, but that does climb over time.

thetwopct 11 hours ago

Interesting but Laravel usage seems so low. As Laravel uses React or Vue etc I feel like a site could be easily classified as React when it’s Laravel running the show.

jrhizor 22 hours ago

This was interesting to look at for my own launch https://stackscope.dev/launch/xisjc21x/elmo

I'm a little surprised you can't see the analysis for StackScope itself!

Also, it'd be interesting to see the trend of vibe score over time.

  • datafreak_ 21 hours ago

    I'm trying but it's getting blocked by Cloudflare and ending up getting broken, ha!

    I will include a vibe score trend in the next blog post.

Gabriel_Valente 1 day ago

Nice one, looks interesting. I didn't knew that Vercel was so far ahead on hosting. People don't usually use Cloudflare Pages for hosting? Or it doesn't enter in that metric? https://stackscope.dev/trends/2026/06

Overall this looks solid

  • datafreak_ 1 day ago

    Cloudflare Pages get hidden behind the CF CDN, so are included in the CDN figure.

8organicbits 19 hours ago

Cool tool, I'm also surprised by how different the startup stacks are from the general Internet.

For HSTS, don't forget to check the preload list. Domains under .dev are all preloaded, for example, so they don't need to set the header for HSTS to apply.

idid 1 day ago

One extension, beyond stack: market category/domain/application - or any combo that tells me what the product does.

Fab project otherwise!

bji9jhff 16 hours ago

"Verified bot" for cloudflare sounds like "Acceptable ad" for adblock.

cpursley 22 hours ago

Makes me sad how Nextjs has become the default for so many startups…

  • sandeepkd 21 hours ago

    Unfortunately the chances are that it would become cyclic with the increasing use of LLMs to generate code