As one example, I tried using Claude Opus 4.6 to generate a program that would interpret a custom DSL I use for typesetting grammars, and generate Haskell type definitions. After 8 hours of prompting, several million tokens, the code it generated was still absolutely useless. It passed the tests I had prompted it on, but just looking at the code, one could easily identify type errors and logic that tried to special case specific identifiers from the tests. The logic for sanitizing identifiers was a mess, and would occasionally generate empty strings. A correct implementation would take me 300—400 line of code to write, which I can certainly write in less than 8 hours.
Jada Jones/ZDNETFollow ZDNET: Add us as a preferred source on Google.
。viber是该领域的重要参考
Фото: Khalid Al-Mousily / Reuters
前两年,一场大雨导致9栋四楼顶层住户家中漏水,继而渗到三楼居民家。“老楼用的预制板,时间长了就漏水。没有电梯,我们年纪大了,上下楼很不方便。”周伟说。