Tech
Patronus AI lands $50M to build ‘digital worlds’ that stress-test AI agents
AI agents are becoming more sophisticated. They are evolving from answering questions to autonomously executing multi-step complex tasks.
But before these agents can be trusted to book trips or conduct financial analysis on behalf of users, model providers and the startups building such agents want to ensure that they perform reliably across a vast range of scenarios.
AI labs often use benchmarks to show off their model’s prowess, but a high score, even on an agent-oriented benchmark, doesn’t actually prove that an AI can accomplish various complex, real-world jobs correctly.
Patronus AI, a startup founded in 2023 by former Meta AI researchers Anand Kannappan and Rebecca Qian, is helping model makers and companies fine-tune models to do just that by building simulated digital environments in which to evaluate the agents’ performance.
The San Francisco-based startup must be solving an important problem. Virtually every frontier AI lab and many emerging startups are now customers, according to Glenn Solomon, a managing director at Notable Capital, who describes demand for the company’s simulated environments as nearly insatiable.
Patronus’ revenue has grown 15-fold over the past year, fueling significant investor interest. On Thursday, the company announced a $50 million Series B round led by Greenfield Partners, with participation from Notable Capital, Lightspeed, Datadog, and Samsung. The funding brings the company’s total funding to $70 million.
Patronus uses what it calls “digital world models” to create replicas of websites and internal systems. In these environments, agents are stress-tested after training using reinforcement learning, which iteratively rewards successful task completion and penalizes errors.
AI labs see great value in these digital simulations because they give agents a chance to try different, sometimes unpredictable, scenarios. The company compares its approach to how Waymo trained autonomous cars by first building synthetic worlds to test vehicles against rare hazards, such as severe weather or a child running after a ball.
The difference with AI agents is that they tend to take shortcuts, which means they fail to complete the task correctly. “Patronus is really good at spotting the hacks and making sure they are holding the models accountable,” Solomon said.
Patronus is currently providing its simulated digital worlds for software engineering and finance, but these are just the start, according to Kannappan.
“Today we’re very focused on the problems that are verifiable, so the problems that you can immediately check and verify, but there are a ton more areas that are very non-verifiable or very hard to verify,” he said.
Just because these processes are verifiable doesn’t mean they are simple. “We want to be able to actually create the environment in which you can operate an agent that can run for 10 hours or 10 days or 10 weeks,” Kannappan said.
As for rivals, Patronus believes it is primarily competing against the internal teams AI labs have already built to evaluate agent behavior. While human-data firms like Mercor and Surge help model makers with reinforcement learning, Patronus operates differently by evaluating how agents behave without any human involvement.
When you purchase through links in our articles, we may earn a small commission. This doesn’t affect our editorial independence.
>
Tech
Polymarket says hackers stole users’ funds
Prediction market giant Polymarket confirmed that hackers stole funds from an unspecified number of users after a third party breach.
In an X post on Thursday, Polymarket said that a compromise at a third party vendor allowed hackers to inject malicious code into its website “for some users.” The company said it has “contained” the incident and is now contacting the affected victims and “refunding them in full.”
As of Thursday afternoon, it’s unclear exactly what happened.
When reached by TechCrunch, Polymarket spokesperson Connor Brandi confirmed that the breach led to users’ funds being stolen, but declined to provide more information, nor respond to specific questions about the incident.
Around the same time as the Polymarket post, blockchain monitoring firm PeckShield reported on X that a phishing campaign was targeting Polymarket users. According to Peckshield, hackers had stolen around $3 million worth of cryptocurrency.
A blockchain analyst also reported similar losses and claimed that the funds were stolen from more than 11 victims.
Polymarket offers users the possibility of being paid in cryptocurrency.
In the last couple of days, two people on social media claimed to have had their Polymarket funds stolen.
The hack is the latest blow for a company that has been in the headlines for the wrong reasons this week. On Sunday, an investigation revealed that Polymarket had paid online creators to post deceptive videos showing they won lucrative bets that were actually fake. In response, the company said it would audit its promotional content.
>
Tech
Interpol: Cybercrime Hits 30% of Recorded Crime in Surveyed APAC Countries
Interpol’s latest Asia and South Pacific cybercrime assessment shows how phishing, ransomware, DDoS attacks, infostealers, and AI-enabled scams are raising security risks across APAC.
The post Interpol: Cybercrime Hits 30% of Recorded Crime in Surveyed APAC Countries appeared first on TechRepublic.
>
Tech
Denmark Ordered to Pay $12M Over Huawei Equipment Removal
A Danish court ordered the state to compensate TDC NET after the removal of Huawei fiber-network equipment, raising questions about telecom security costs.
The post Denmark Ordered to Pay $12M Over Huawei Equipment Removal appeared first on TechRepublic.
>
-
Fashion9 years ago
These ’90s fashion trends are making a comeback in 2017
-
Fashion9 years ago
According to Dior Couture, this taboo fashion accessory is back
-
Fashion9 years ago
Model Jocelyn Chew’s Instagram is the best vacation you’ve ever had
-
Fashion9 years ago
Your comprehensive guide to this fall’s biggest trends
-
Fashion9 years ago
A photo diary of the nightlife scene from LA To Ibiza
-
Fashion9 years ago
Emily Ratajkowski channels back-to-school style
-
Fashion9 years ago
9 Celebrities who have spoken out about being photoshopped
-
Fashion9 years ago
The tremendous importance of owning a perfect piece of clothing