This Hacker Team Is Bulletproofing AI Models For Companies Like OpenAI And Anthropic

  • 📰 ForbesTech
  • ⏱ Reading Time:
  • 47 sec. here
  • 11 min. at publisher
  • 📊 Quality Score:
  • News: 53%
  • Publisher: 59%

Gray Swan Ai 뉴스

Ai Safety,Ai Models,Security

Sarah Emerson is a senior writer who reports on technology companies and culture in Silicon Valley. She's broken news about the empires of billionaires such as Eric Schmidt and fallen billionaire Ryan Breslow. Sarah has also followed the trends and ideologies shaping today's AI zeitgeist.

The researchers behind Gray Swan AI started the company after finding a major vulnerability in models from OpenAI, Anthropic, Google and Meta. Now, they build products that help safeguard them.

The breakneck pace at which AI is evolving has created a vast ecosystem of new companies — some creating ever more powerful models, others identifying the threats that may accompany them. Gray Swan is among the latter but takes it a step further by building safety and security measures for some of the issues it identifies. “We can actually provide the mechanisms by which you remove those risks or at least mitigate them,” Kolter told.

Looking forward, Gray Swan is keen on cultivating a community of hackers, and it’s not alone. At last year’s Defcon security conference, more than 2,000 people participated in an AIoften enlist internal and external red teamers to assess new models, and have announced official bug bounty programs that reward sleuths for exposing exploits around high-risk domains, such as CBRN .a vulnerability in Anthropic’s Claude Sonnet-3.5 — are also valuable resources for model developers.

이 소식을 빠르게 읽을 수 있도록 요약했습니다. 뉴스에 관심이 있으시면 여기에서 전문을 읽으실 수 있습니다. 더 많은 것을 읽으십시오:

 /  🏆 318. in KR
 

귀하의 의견에 감사드립니다. 귀하의 의견은 검토 후 게시됩니다.

대한민국 최근 뉴스, 대한민국 헤드 라인