9 comments

  • worble 1 hour ago
    I'd be curious in how well it passes 100th Coin's NES accuracy tests https://github.com/100thCoin/AccuracyCoin
    • utopiah 1 hour ago
      Indeed, that's what I kind of hinted at in https://news.ycombinator.com/item?id=46442195 and coincidentally https://news.ycombinator.com/item?id=46437688 briefly after, namely that OK, one can "generate" a "solution", that's much easier than before... but until we can verify somehow that it actually does what it say it does (and we know of hallucinations and have no reason to believe this changed) then testing itself, especially of well know "problems" is more and more important.

      That being said, it doesn't answer the "why" in the first place, an even more important question. At least though it does help somehow to compare with existing alternatives.

      • garciasn 20 minutes ago
        Isn’t this how all software development works? Folks commit code, it’s tested, and reviewed, and then deployed.

        Why would this be any different?

        • PaulDavisThe1st 1 minute ago
          That's not how software development works.

          Folks think, they write code, they do their own localized evaluation and testing, then they commit and then the rest of the (down|up)stream process begins.

          LLM's skip over the "actually verify that the code I just wrote does what I intended it to" step. Granted, most humans don't do this step as thoroughly and carefully as would be desirable (sometimes through laziness, sometimes because of a belief in (down|up)stream testing processes). But LLM's don't do it at all.

    • roger_ 49 minutes ago
      I’m sure you can point Claude at that page and have it make the necessary changes to pass.
      • deadbabe 8 minutes ago
        Or it could loop infinitely, never quite being able to pass all the tests.
  • zorked 1 hour ago
    Nice, but NES emulator is one of the most written pet projects anywhere, which makes it considerably less impressive.
    • StilesCrisis 1 hour ago
      Heck, when Satya Nadella wanted to demonstrate Copilot coding, he had it emit an Altair emulator. I guess there's little room for creativity in 8-bit emulator design so LLMs can handle them well. https://thenewstack.io/from-basic-to-vibes-microsofts-50-yea...
    • noident 1 hour ago
      Somewhere along the line the AI bros stopped separating training and testing sets. It's great for impressing the villagers
  • keyle 1 hour ago
    Who care what it did. What did you learn? To live is to learn.
    • mikkupikku 1 hour ago
      When I consider the utility of a hammer, my first priority is to ask what the hammer can teach me.
      • NoraCodes 23 minutes ago
        Do you think that the use of a hammer is an innate skill, and that woodworkers learn nothing from their craft?
      • pygy_ 1 hour ago
        There are NES emulators aplenty, the only value in writing a new one is pedagogic, for the writer.

        This endeavor had negative net value.

        • mikkupikku 26 minutes ago
          How about being entertained by the process?
          • worthless-trash 10 minutes ago
            They didnt call it the "Nintendo Entertainment System" for nothing.
      • jancsika 10 minutes ago
        If it's a zillion dollar hammerbot the company is offering to your boss for pennies, that had better be your first priority!
      • risyachka 32 minutes ago
        Do you like to read posts about what hammer can do? Especially when it has been done 100 times already.
    • password54321 29 minutes ago
      Yeah I think this is the wrong approach. If they were making money out of it, that would be different. But this is pointless.
    • jgbuddy 1 hour ago
      to live is to build
      • shriek 10 minutes ago
        to build what you don't understand is to suffer in future
      • krapp 1 hour ago
        Except OP isn't learning or building. He's telling a computer to do the work for him and padding his resume.
        • danielbln 4 minutes ago
          How cynical. Just seeing if the current crop of automation systems can do it can be interesting enough for some of us.
  • deadbabe 8 minutes ago
    I will be impressed when new game consoles come to market and it can write the first emulator for it.
  • swannodette 41 minutes ago
    WASM and the performance seems catastrophically bad (45ms to render a frame on an M4 laptop)? It would be much more impressive if Claude could optimize it into something that someone would actually want to play? Compare this to a random hit from Google, https://jsnes.org/ which has sound, much smaller payload, and runs really fast (<1ms to render a frame).

    The cost of slop is >40X drop in performance? Pick any metric that you care about for your domain perhaps that's what you're going to lose and is the effort to recover that practical with current vibe-coding strategies?

  • delduca 2 hours ago
    • johnisgood 1 hour ago
      Why not use the LLM for more meaningful commit titles & messages as well while you are at it?
    • giancarlostoro 1 hour ago
      Surprised there's no README file at all.
  • cebert 1 hour ago
    It’s a shame that the source code isn’t commented and documented more. At the very least, I would see it being helpful to add some documentation for every CPU op code being emulated.
    • 112233 1 hour ago
      Forbidding LLM to write comments and docstrings (preferrably enforced by build and commit hook) is one of the best "hacks" for using that thing. LLM cannot help itself but emit poisonous comments.
      • jacquesm 13 minutes ago
        Or maybe clone the comments from where it cloned the source.
    • StilesCrisis 1 hour ago
      Probably better to look at a human-authored emulator if you want comments containing accurate information anyway.
    • bugfix 51 minutes ago
      If you let it, Claude Code will write a comment for almost every single line of code.
      • mikepurvis 41 minutes ago

            # Assign value of x to y
            y = x
  • Y_Y 1 hour ago
    Git wrote a functional NES emulator for me by simply cloning one of the many publicly available ones!
    • LunicLynx 1 hour ago
      This is the comment.

      Give it copy paste / translate tasks and it’s a no brainer (quite literally)

      But same can be said of humans.

      The question here is, did it implement it because it read the available online documentation about the NES architecture OR did it just see one too many of such implementations.

      • jacquesm 11 minutes ago
        > But same can be said of humans.

        Indeed, the 'cleanroom' standard always was one team does the RE and writes a spec, another team that has never seen the original (and has written statements with penalty clauses to prove it) then does the re-implementation. If you were to read the implementation, write the spec and then write the re-implementation that would be definitely violating the standard for claiming an original work.

    • draw_down 1 hour ago
      [dead]
  • cgfjtynzdrfht 1 hour ago
    Trained on 1000s of NES emulators, it's not really impressive.

    Github alone has +4k NES emulator projects: https://github.com/search?q=nes%20emulator&type=repositories

    This is more like "wow, it can quote training data".