title: "ν΄λ‘λ μ€νΌμ€ 4.8, λ²€μΉλ§ν¬ μ λ" description: "λ΄μ€ - μλ¬Έ κΈ°λ° μμ½ νμ" date: 2026-05-29 tags: [ai-news] source: "https://the-decoder.com/anthropic-ships-claude-opus-4-8-as-a-modest-but-tangible-improvement-that-tops-gpt-5-5-in-most-benchmarks/" sidebar: order: 0
μ λͺ©(νκΈ): ν΄λ‘λ μ€νΌμ€ 4.8, λ²€μΉλ§ν¬ μ λ μλ¬Έ μ λͺ©(μλ¬Έ): Anthropic ships Claude Opus 4.8 as a "modest but tangible improvement" that tops GPT-5.5 in most benchmarks μλ¬Έ: Anthropic ships Claude Opus 4.8 as a "modest but tangible improvement" that tops GPT-5.5 in most benchmarks μμ€: the-decoder MD νμΌ: content/2026-05-29/the-decoder-anthropic-ships-claude-opus-4-8-as-a-modest-but-ta.md
ν΅μ¬ λ΄μ©
Anthropicμ΄ Claude Opus 4.8μ 곡κ°νκ³ , λλΆλΆ λ²€μΉλ§ν¬μμ GPT-5.5μ Gemini 3.1 Proλ₯Ό μμ°μ΄μ.
μμ΄μ ν± μ½λ©(SWE-Bench Pro)μ 69.2%λ‘ Opus 4.7μ 64.3%, GPT-5.5μ 58.6%λ³΄λ€ λμμ΄μ. Humanity's Last Examμ λꡬ μμ΄ 49.8%, λꡬ μ¬μ© μ 57.9%λ‘ μ΅κ³ μ μλ₯Ό κΈ°λ‘νμ΄μ.
Anthropicμ νΉν μ μ§μ± κ°μ μ κ°μ‘°νμ΄μ. μ΄κΈ° ν μ€ν° κΈ°μ€μΌλ‘ λΆνμ€μ±μ λ μμ£Ό λ°νκ³ , κ·Όκ±° μλ μ£Όμ₯λ μ€μκ³ μ. μ체 μ½λ© νκ°μμ λ²κ·Έλ₯Ό κ·Έλ₯ λκΈ°λ λΉμ¨μ΄ 4.7 λλΉ μ½ 4λ°° κ°μνλ€κ³ λ°νμ΄μ.
λͺ¨λΈ μ±λ₯λ ν¬μ§λ§, ν μΈμ μμ μλ°± κ° λ³λ ¬ μλΈμμ΄μ νΈλ₯Ό λ리λ λμ μν¬νλ‘μ°κ° μ€μ μ 무 μλνμ μ²΄κ° λ³νλ₯Ό ν€μΈ ν¬μΈνΈμμ.
μ‘λμ€μ νλ§λ
λ²κ·Έλ₯Ό λμΉκ³ λ μ§μ²μ²λΌ λ§νλ λΉλκ° 4λ°° μ€μλ€κ³ ν΄μ. λμ μν¬νλ‘μ°λ‘ λκ·λͺ¨ μ½λ λ§μ΄κ·Έλ μ΄μ μλνλ λ Έλ¦΄ μ μμ΄μ.