Databricks, a San Francisco-based startup last valued at $38 billion, on Friday released open-source code that it said companies could use to create their own chatbots along the lines of OpenAI’s ChatGPT.
The code is an AI model, an algorithm that is trained on sets of data and can then learn from new data to perform a variety of tasks.
Databricks CEO Ali Ghodsi said the release was aimed at demonstrating a viable alternative to training a kind of AI model called a large language model with enormous resources and computing power.
A large language model underpins OpenAI’s viral chatbot ChatGPT. OpenAI, valued at $29 billion, trains its AI models with huge troves of data on a supercomputer from investor Microsoft Corp (MSFT.O). The computing costs are “eye-watering”, OpenAI CEO Sam Altman has said.
OpenAI charges business for access to its models for their own applications and has projected $1 billion in sales by 2024.
Databricks’ effort comes with caveats. Ghodsi told Reuters that, while the open-source chatbot displayed impressive capabilities at such tasks as drafting blog posts, the company had not released formal benchmark tests to show that the bot matched ChatGPT’s performance.
Databricks sells cloud-based data mining and analytics software to businesses and said last year it had surpassed $1 billion in annualized revenue.
Databricks wants enterprises to train their own AI models using its software. Ghodsi said the company’s researchers had taken a two-year-old model that was freely available and trained it with a small amount of data for three hours on single computer that anyone with a credit card could rent….