Wednesday, April 16, 2025
HomeAIMicrosoft now hosts AI model accused of copying OpenAI data

Microsoft now hosts AI model accused of copying OpenAI data

Share

Fresh on the heels of a controversy in which ChatGPT-maker OpenAI accused the Chinese company behind DeepSeek R1 of using its AI model outputs against its terms of service, OpenAI’s largest investor, Microsoft, announced on Wednesday that it will now host DeepSeek R1 on its Azure cloud service.

DeepSeek R1 has been the talk of the AI world for the past week because it is a freely available simulated reasoning model that reportedly matches OpenAI’s o1 in performance—while allegedly being trained for a fraction of the cost.

Azure allows software developers to rent computing muscle from machines hosted in Microsoft-owned data centers, as well as rent access to software that runs on them.

“R1 offers a powerful, cost-efficient model that allows more users to harness state-of-the-art AI capabilities with minimal infrastructure investment,” wrote Microsoft Corporate Vice President Asha Sharma in a news release.

DeepSeek R1 runs at a fraction of the cost of o1, at least through each company’s own services. Comparative prices for R1 and o1 were not immediately available on Azure, but DeepSeek lists R1’s API cost as $2.19 per million output tokens, while OpenAI’s o1 costs $60 per million output tokens. That’s a massive discount for a model that performs similarly to o1-pro in various tasks.

Promoting a controversial AI model

On its face, the decision to host R1 on Microsoft servers is not unusual: The company offers access to over 1,800 models on its Azure AI Foundry service with the hopes of allowing software developers to experiment with various AI models and integrate them into their products. In some ways, whatever model they choose, Microsoft still wins because it’s being hosted on the company’s cloud service.

In another way, though, the move is a stamp of legitimacy on an AI model that has caused consternation for OpenAI over the past week. The controversy primarily centers on whether DeepSeek used OpenAI’s models to produce outputs (synthetic data) that DeepSeek then used to train or fine-tune its own models, a practice often called “distillation,” which is against OpenAI’s terms of service.

Since the launch of DeepSeek V3 (a large language model that served as the progenitor of R1), users have reported that the model often calls itself ChatGPT, which suggests that at least some ChatGPT-produced data was used to fine-tune V3’s behavior. It wouldn’t be the first time AI researchers have cribbed off of OpenAI: AI experts accused Elon Musk’s xAI of doing something similar with its Grok AI model in December 2023.

And that’s not the only issue at hand. In addition to the terms-of-service accusation and testy tweets from OpenAI employees, Microsoft also reportedly launched a probe into DeepSeek after Microsoft’s security researchers discovered that the Chinese company may have extracted substantial data for training purposes through OpenAI’s API during the fall of 2024, according to Bloomberg.

Despite the controversies, OpenAI CEO Sam Altman welcomed the additional competition from DeepSeek earlier this week. On Monday, Altman tweeted, “deepseek’s r1 is an impressive model, particularly around what they’re able to deliver for the price. we will obviously deliver much better models and also it’s legit invigorating to have a new competitor! we will pull up some releases.”

As a response to R1’s rise, OpenAI is expected to release o3-mini through ChatGPT as soon as later today.

Popular

Related Articles

Nvidia H20 chip exports hit with license requirement by US government

Semiconductor giant Nvidia is facing unexpected new U.S. export controls on its H20...

The Impact of AI on the Human Brain

Avi Loeb is the head of the Galileo Project, founding director of Harvard University’s — Black...

Notorious image board 4chan hacked and internal data leaked

Notorious internet forum 4chan was hacked on Tuesday.  At the time of...

Figuring Out What Lies Outside the Solar System is the Day Job of Astronomers, not Government

Figuring Out What Lies Outside the Solar System is the Day Job of Astronomers,...

Apple details how it plans to improve its AI models by privately analyzing user data

In the wake of criticism over the underwhelming performance of its AI products,...

Debates over AI benchmarking have reached Pokmon

Not even Pokémon is safe from AI benchmarking controversy. Last week,...

OpenAI plans to phase out GPT-4.5, its largest-ever AI model, from its API

OpenAI said on Monday that it would soon wind down the availability of...