When it comes to evaluating AI-generated code, functional accuracy has long been the gold standard. But what about user experience—the look, feel, and usability that make digital tools actually enjoyable? Tencent thinks it's time for a change, and it just introduced a new benchmark, ArtifactsBench, to fill