title: "Claude Fable 5, μ체 λ²€μΉλ§ν¬ 95% vs μ€μ 보μ μ·¨μ½μ μμ 19%" description: "λ²€μΉλ§ν¬ - μλ¬Έ κΈ°λ° μμ½ νμ" date: 2026-06-12 tags: [ai-news] source: "https://dev.to/crescevo/claude-fable-5-scores-95-on-its-own-benchmark-and-19-on-real-security-work-the-gap-is-the-lesson-3j7" sidebar: order: 0
μ λͺ©(νκΈ): Claude Fable 5, μ체 λ²€μΉλ§ν¬ 95% vs μ€μ 보μ μ·¨μ½μ μμ 19% μλ¬Έ μ λͺ©(μλ¬Έ): Claude Fable 5 Scores 95% on Its Own Benchmark and 19% on Real Security Work. The Gap Is the Lesson. μλ¬Έ: Claude Fable 5 Scores 95% on Its Own Benchmark and 19% on Real Security Work. The Gap Is the Lesson. μμ€: dev-to-ai MD νμΌ: content/2026-06-12/dev-to-ai-claude-fable-5-scores-95-on-its-own-benchmark-and-.md
ν΅μ¬ λ΄μ©
Anthropicμ΄ Claude Fable 5μ μ½λ© λ²€μΉλ§ν¬ μ μλ₯Ό λ°ννμ΄μ. SWE-bench Verified μ½ 95%, SWE-bench Pro 80.3%λ‘ 2μ λͺ¨λΈλ³΄λ€ 11μ μμλ μμΉκ±°λ μ.
κ·Έλ°λ° 보μ μ λ¬Έ νκ°μ¬ Endor Labsκ° λ 립 ν μ€νΈλ₯Ό λλ Έλλ μκΈ°κ° λ¬λΌμ‘μ΄μ. μ€μ μ½λμ μ·¨μ½μ μ κ³ μΉλ©΄μ κΈ°λ₯λ μ μ§ν΄μΌ νλ μνμμ κΈ°λ₯ ν΅κ³Όμ¨ 59.8%, 보μ μ·¨μ½μ μ€μ μμ λ₯ μ 19.0%λ‘ μ€μκΆμ λ¨Έλ¬Όλ μ΄μ.
λ λμ λλ 건 200κ° μΌμ΄μ€ μ€ 38κ°μμ '컀λ' μ ν©μ΄ λ°κ²¬λλ€λ μ μ΄μμ. νμ΅ λ°μ΄ν°μ ν¬ν¨λ ν¨μΉλ₯Ό κ·Έλλ‘ μ¬ννλλ°, μ¬μ§μ΄ λ¬Έμ μ μλ CVE λ²νΈκΉμ§ λ΅μ λΌμ΄ λμκ±°λ μ. κ²°κ΅ 95%μ 19%λ κ°μ λͺ¨λΈμ κ°λ¦¬ν€κ³ μμ΄μ. μ΄λ€ λ²€μΉλ§ν¬λ₯Ό λ―Ώμμ§λ³΄λ€, λ΄ μμ νκ²½μμ μ§μ ν μ€νΈν΄λ³΄λ κ² λ μ€μν΄μ§ μλμμ.
μ‘λμ€μ νλ§λ
μ μλ ν΄λΉ μ°κ΅¬μμ ν μ€νΈ νκ²½μ λ°μν΄μ. λ΄ μ€μ μ½λλ² μ΄μ€μμ μ΄λ»κ² λμνλμ§λ μ§μ νμΈν΄μΌ μ μ μμ΄μ.
μΆμ²: Claude Fable 5 Scores 95% on Its Own Benchmark and 19% on Real Security Work. The Gap Is the Lesson.