Show HN: I built a tool that watches webpages and exposes changes as RSS

(sitespy.app)

156 points | by vkuprin 8 hours ago

25 comments

tonyekh 1 minute ago
Nice work! RSS feed is quite handy. Email alert would be a nice option as well.
ahmedfromtunis 4 hours ago
As a (former) reporter, site monitoring is a big part of what I do on a daily basis and I used many, many such services.
I can attest that, at least from the landing page, this seems to be a very good execution of the concept, especially the text-based diffing to easily spot what changed and, most importantly, how.
The biggest hurdle for such apps however are 'js-based browser-rendered sites' or whatever they're called nowadays. How does Site Spy handle such abominations?
[-]
- vkuprin 3 hours ago
  Thanks, that’s a really good question. Site Spy uses a real browser flow, so it generally handles JS-rendered pages much better than simple HTML-only polling tools. In practice, the trickier cases tend to be sites with aggressive anti-bot protection or messy login/session flows rather than JS itself. I’m trying to make those limitations clearer so people don’t just hit a vague failure and feel let down
xnx 6 hours ago
I like https://github.com/dgtlmoon/changedetection.io for this. Open source and free to run locally or use their Saas service.
[-]
- raphman 5 hours ago
  There's also https://github.com/thp/urlwatch/ - (not aware of any SaaS offer - self-hosted it is).
  [-]
  - vkuprin 5 hours ago
    Yep, urlwatch is a good one too. This category clearly has a strong self-hosted tradition. With Site Spy, what I’m trying to make much easier is the browser-first flow: pick the exact part of a page visually, then follow changes through diffs, history, RSS, and alerts with very little setup
- vkuprin 5 hours ago
  Yep, changedetection.io is a good project. With Site Spy, I wanted to make the browser-first workflow much easier: install the extension, connect it to the dashboard, click the exact part of the page you care about, and then follow changes as diffs, history, or RSS with very little setup. I can definitely see why the open-source / self-hosted route is appealing too.
  [-]
  - xnx 4 hours ago
    Changedetection has an extension too: https://chromewebstore.google.com/detail/changedetectionio-w...
- beepbooptheory 5 hours ago
  Sure but this one has a MCP server, costs money, and was presumably made last night!
  [-]
  - nicbou 2 hours ago
    It's been around for a while and recommended by many. I tried it myself and it's okay.
- pelcg 5 hours ago
  Looks cool and this can be self hosted and it is for free.
  Nice will try this out!
pentagrama 35 minutes ago
That looks nice, I use the free plan of https://visualping.io for some software changelogs, RSS feeds are a paid feature. Will check this out.
[-]
- vkuprin 18 minutes ago
  Yeah, RSS is free on all plans — it felt like a core feature, not an upsell
tene80i 5 hours ago
RSS is a useful interface, but: "Do most people just want direct alerts?" Yes, of course. RSS is beloved but niche. Depends who your target audience is. I personally would want an email, because that's how I get alerts about other things. RSS to me is for long form reading, not notifications I must notice. The answer to any product question like this totally depends on your audience and their normal routines.
[-]
- ikari_pl 4 hours ago
  It's niche because some companies decided so.
  you used to have native RSS support in browsers, and latest articles automatically in your bookmarks bar.
  [-]
  - ctxc 4 hours ago
    That's good reasoning, but the parent's point still stands?
- Symbiote 1 hour ago
  I added my employer's website RSS feed to the all-staff Slack channel. I find it useful, I don't know about others but no one has grumbled.
  https://slack.com/intl/en-gb/help/articles/218688467-Add-RSS...
plutokras 2 hours ago
I have my own hobby RSS server built around the Google Reader API. Two of my plugins are pretty similar to what you described: one checks a page’s current state against the last saved version and publishes an entry if anything changed, the other is basically a CSS selector-based feed builder. Always good to see RSS content here, thanks for posting!
On your questions: some people prefer RSS, others email, and services exist to convert between the two in both directions. My own rule of thumb is email for things that need actual attention and RSS for everything that can wait. If you’re thinking about turning this into a service, supporting both would make sense since people are pretty split on this.
enoint 6 hours ago
Quick feedback:
1. RSS is just fine for updates. Given the importance of your visa use-case, were you thinking of push notifications?
2. Your competition does element-level tracking. Maybe they choose XPath?
[-]
- vkuprin 5 hours ago
  Yep, Site Spy already has push notifications, plus email and Telegram alerts. I see RSS as the open interface for people who want to plug updates into their own reader or workflow. For urgent things like visa slots or stock availability, direct alerts are definitely the main path.
  And yeah, element-level tracking isn't a brand new idea by itself. The thing I wanted to improve was making it easy to pick the exact part of a page you care about and then inspect the change via diffs, history, or RSS instead of just getting a generic "page changed" notification
iamflimflam1 4 hours ago
Something I was planning on building but never got round - if anyone wants to do it then feel free to use this idea.
Lots of companies really have no idea what javascript is being inserted into their websites - marketing teams add all sorts of crazy scripts that don't get vetted by anyone and are often loaded dynamically and can be changed without anyone knowing.
A service that monitors a site and flags up when the code changes - even better if it actually scans and flags up malicious code.
dogline 3 hours ago
With lots of people showing how Saas apps can be easily written these days, I'm not as interested in those articles, as people showing off new ideas of what I can do with these new found abilities. This is cool.
lkozloff 3 hours ago
Love this - I had a similar idea years ago, specifically for looking at long-text privacy policies and displaying the `diff`... but obviously never built it.
What you've done here is that and so much more. Congrats!
Hauk307 2 hours ago
This is cool. I'd use it to track when state wildlife agencies update their regulation pages — those change once a year with no announcement and I always miss it. Element-level tracking would be perfect for that vs watching the whole page. To answer your question: I'd want both RSS and direct alerts (email/push) depending on urgency.
m-hodges 27 minutes ago
Great for opponent monitoring on political campaigns. We made an in-house version of this on Biden ‘20.
layman51 4 hours ago
How might this tool work in terms of “archiving” a site? This is just something I was wondering given the recent change and controversy about archiving service sites on Wikipedia.
[-]
- vkuprin 3 hours ago
  Site Spy keeps snapshot history, so you can revisit older versions of a page and inspect how it changed over time, not just get the latest alert. I’d describe it more as monitoring with retained history than as a dedicated public archive, but deeper archival integrations are definitely something I’ve thought about
dev_at 4 hours ago
There's also AnyTracker (an app) that gives you this information as push notifications: https://anytracker.org/
[-]
- Knork-and-Fife 4 hours ago
  and also visualping.io which sends email alerts
OSaMaBiNLoGiN 1 hour ago
Tool looks useful. But how is it that toggling between light/dark mode results in a multi-second freeze..? Scrolling drops frames, confirmed with dev tools.
Tested on m1 pro 2021 laptop and recent higher-end (4080, 14700k, etc) desktop. Same on both.
The fuck?
[-]
- vkuprin 57 minutes ago
  Yeah, that was a real bug — CSS transitions on the body were blocking the thread during theme switches. I pushed a fix for it earlier today. Should be smooth now, but let me know if you still see it
butterlesstoast 3 hours ago
This is quite a lovely implementation. Congrats!
bananaflag 5 hours ago
Very good!
This is something that existed in the past and I used successfully, but services like this tend to disappear
[-]
- vkuprin 5 hours ago
  That’s a completely fair concern. Services in this category do need to earn trust over time. I built the backend to handle a fair amount of traffic, so I’m not too worried about growth on that side. My goal is definitely to keep this running for the long term, not treat it like a one-off project
hinkley 5 hours ago
Back in 2000 I worked for a company that was trying to turn something like this into the foundation for a search engine.
Essentially instead of having a bunch of search engines and AI spamming your site, the idea was that they would get a feed. You would essentially scan your own website.
As crawlers grew from an occasional visitor to an actual problem (an inordinate percent of all consumer traffic at the SaaS I worked for was bots rather than organic traffic, and would have been more without throttling) I keep wondering why we haven’t done this.
Google has already solved the problem of people lying about their content, because RSS feeds or user agent sniffing you can still provide false witness to your site’s content and purpose. But you’d only have to be scanned when there was something to see. And really you could play games with time delays on the feed to smear out bot traffic over the day if you wanted.
[-]
- deceptionatd 2 hours ago
  Well-designed sitemaps and use of something like https://www.indexnow.org/ helps.
  Cloudflare has Crawler Hints which works well IME: https://blog.cloudflare.com/crawler-hints-how-cloudflare-is-...
makepostai 6 hours ago
This is interesting, gonna try it on our next project! thumb up
nicbou 4 hours ago
Buddy I love you!
I have wanted this for so long! My job relies on following many German laws, bureaucracy pages and the like.
In the long run I want specific changes on external pages to trigger pull requests in my code (e.g. to update a tax threshold). This requires building blocks that don't exist, and that I can't find time to code and maintain myself.
I currently use Wachete, but since over a year, it triggers rate limits on a specific website and I just can't monitor German laws anymore. No tools seem to have a debounce feature, even though I only need to check for updates once per month.
[-]
- vkuprin 3 hours ago
  German laws and bureaucracy pages are exactly the kind of thing where tracking one specific part of a page is much more useful than watching the whole page. And yeah, more control over check frequency makes a lot of sense if monthly checks are enough and rate limits are the main problem. I’d be curious what kind of schedule would work best for you there?
  [-]
  - nicbou 2 hours ago
    Monthly is fine, but not monthly all at once, because I watch multiple pages on one website, and that triggers the rate limiting.
    The ideal pipeline for me would be "notice a change in a specific part of a page, use a very small LLM to extract a value or answer a question, update a constant in a file and make a pull request".
    I've been thinking about this pipeline for a long time because my work depends on it, but nothing like it seems to exist yet. I'll probably write my own, but I just can't find the time.
    [-]
    - vkuprin 51 minutes ago
      You can already work around the rate-limit issue today — there's a global minimum recheck interval in Settings that spreads checks out across time. Not per-site throttling yet, but it prevents one domain from getting hit too many times at once.
      The pipeline you described — detect a change, extract a value with a small LLM, open a PR — is pretty much exactly what the MCP server is designed for. Connect Site Spy to Claude or Cursor, and when a specific part of a page changes, the agent can handle the extraction and PR automatically. I don't think anyone has wired up that exact flow yet, but all the pieces exist.
digitalbase 6 hours ago
Cool stuff. You should make it OSS and ask a one time fee for it. I would run it on my own infra but pay you once(.com)
pwr1 6 hours ago
Interesting... added to bookmarks. Could come in handy in the future
breadcat 3 hours ago
i love a good rss tool. Thanks for sharing
docybo 3 hours ago
that's quiet good. will give a try congrat !