๐Ÿ› ๏ธAI ๋„๊ตฌ2026-06-19

๋‰ด์Šค - ์›๋ฌธ ๊ธฐ๋ฐ˜ ์š”์•ฝ ํ•„์š”

๐Ÿ’ก ํ•œ์ค„ ์š”์•ฝ|๋‰ด์Šค - ์›๋ฌธ ๊ธฐ๋ฐ˜ ์š”์•ฝ ํ•„์š”


title: "OpenAI, ์ƒ๋ช…๊ณผํ•™ AI ๋ฒค์น˜๋งˆํฌ LifeSciBench ๊ณต๊ฐœ" description: "๋‰ด์Šค - ์›๋ฌธ ๊ธฐ๋ฐ˜ ์š”์•ฝ ํ•„์š”" date: 2026-06-19 tags: [ai-tool] source: "https://www.marktechpost.com/2026/06/17/openai-releases-lifescibench-a-750-task-benchmark-grading-ai-models-on-real-life-science-research-with-expert-written-rubric/" sidebar: order: 0

์ œ๋ชฉ(ํ•œ๊ธ€): OpenAI, ์ƒ๋ช…๊ณผํ•™ AI ๋ฒค์น˜๋งˆํฌ LifeSciBench ๊ณต๊ฐœ ์›๋ฌธ ์ œ๋ชฉ(์˜๋ฌธ): OpenAI Releases LifeSciBench, a 750-Task Benchmark Grading AI Models on Real Life-Science Research With Expert-Written Rubric ์›๋ฌธ: OpenAI Releases LifeSciBench, a 750-Task Benchmark Grading AI Models on Real Life-Science Research With Expert-Written Rubric ์†Œ์Šค: marktechpost MD ํŒŒ์ผ: content/2026-06-19/marktechpost-openai-releases-lifescibench-a-750-task-benchmark-.md

ํ•ต์‹ฌ ๋‚ด์šฉ

OpenAI๊ฐ€ 750๊ฐœ ๊ณผ์ œ๋กœ ๊ตฌ์„ฑ๋œ ์ƒ๋ช…๊ณผํ•™ AI ๋ฒค์น˜๋งˆํฌ LifeSciBench๋ฅผ ๊ณต๊ฐœํ–ˆ์–ด์š”. ์ตœ๊ฐ• ๋ชจ๋ธ๋„ 3๊ฐœ ์ค‘ 1๊ฐœ ๊ณผ์ œ๋ฐ–์— ํ†ต๊ณผ ๋ชป ํ•  ๋งŒํผ ๋‚œ์ด๋„๊ฐ€ ๋†’๊ฑฐ๋“ ์š”.

173๋ช…์˜ ๋ฐ•์‚ฌ๊ธ‰ ๊ณผํ•™์ž๊ฐ€ ์ง์ ‘ ์ž‘์„ฑํ•œ ๊ณผ์ œ๋“ค์ด์—์š”. ๊ฒŒ๋†ˆํ•™ยท์˜์•ฝํ™”ํ•™ยท์ž„์ƒ๊ณผํ•™ ๋“ฑ 7๊ฐœ ๋ถ„์•ผ๋ฅผ ๋‹ค๋ฃจ๊ณ , ๊ณผ์ œ์˜ 79%๊ฐ€ ํ‰๊ท  4๋‹จ๊ณ„ ์ด์ƒ์˜ ์ถ”๋ก ์„ ์š”๊ตฌํ•ด์š”. ์ฃผ๊ด€์‹ ์„œ์ˆ ํ˜•์ด๋ผ ๋‹จ์ˆœ ์•”๊ธฐ๋กœ๋Š” ์ ˆ๋Œ€ ํ†ต๊ณผ ๋ชป ํ•ด์š”.

์ฑ„์ ๋„ ์ •๋ฐ€ํ•ด์š”. ๊ณผ์ œ๋‹น ํ‰๊ท  25๊ฐœ ๊ธฐ์ค€(์ด 19,020๊ฐœ)์œผ๋กœ ๋ถ€๋ถ„ ์ ์ˆ˜๋ฅผ ๋งค๊ธฐ๊ณ , 70% ์ด์ƒ ๋ฐ›์•„์•ผ ํ†ต๊ณผ์˜ˆ์š”. AI๊ฐ€ ์ƒ๋ช…๊ณผํ•™ ์‹ค๋ฌด์—์„œ ์ง„์งœ ์“ธ ๋งŒํ•œ์ง€ ๊ฐ€๋ฆฌ๋Š” ๊ธฐ์ค€์ด ์ƒ๊ธด ๊ฑฐ์˜ˆ์š”.

์žก๋Œ์Œค์˜ ํ•œ๋งˆ๋””

์ตœ๊ฐ• ๋ชจ๋ธ๋„ ํ†ต๊ณผ์œจ 33% ์ˆ˜์ค€์ด์—์š”. AI๊ฐ€ ์‹ค์ œ ์—ฐ๊ตฌ ํ˜„์žฅ์—์„œ ์“ธ ์ˆ˜ ์žˆ๋Š”์ง€ ๊ฐ€๋Š ํ•˜๋Š” ์ฒซ ๊ธฐ์ค€์ด ์ƒ๊ธด ๊ฑฐ์˜ˆ์š”.


์ถœ์ฒ˜: OpenAI Releases LifeSciBench, a 750-Task Benchmark Grading AI Models on Real Life-Science Research With Expert-Written Rubric

์ด ๊ธ€์ด ์–ด๋• ๋‚˜์š”?