OpenAI, the artificial intelligence research company, announced on Thursday a new generation of embedding models, which can convert text into a numerical form that can be used for various machine learning tasks. The company also introduced new versions of its GPT-4 Turbo and moderation models, new API usage management tools, and lower pricing on its GPT-3.5 Turbo model.

Embeddings are sequences of numbers that represent the concepts within content such as natural language or code. Embeddings make it easy for machine learning models and other algorithms to understand the relationships between content and to perform tasks like clustering or retrieval. They power applications like knowledge retrieval in both ChatGPT and the Assistants API, and many retrieval augmented generation (RAG) developer tools.

OpenAI said that its new embedding models, text-embedding-3-small and text-embedding-3-large, offer stronger performance and reduced price compared to its previous generation model, text-embedding-ada-002. The new models can create embeddings with up to 3072 dimensions, which can capture more semantic information and improve the accuracy of downstream tasks.

According to the company, the new models have increased the average score on a commonly used benchmark for multi-language retrieval (MIRACL) from 31.4% to 54.9%, while the average score on a commonly used benchmark for English tasks (MTEB) has increased from 61.0% to 64.6%. The pricing for text-embedding-3-small has also been reduced by 5X compared to text-embedding-ada-002, making it more affordable for developers to use.

The company also updated its GPT-4 Turbo and GPT-3.5 Turbo models, which are large multimodal models that can understand and generate natural language or code. The new versions of the models come with improved instruction following, JSON mode, more reproducible outputs, and parallel function calling. The company also launched a new 16k context version of GPT-3.5 Turbo, which can process longer inputs and outputs than the standard 4k version.

Additionally, the company updated its text moderation model, which can detect whether text may be sensitive or unsafe. The new version of the model can handle more languages and domains, and can also provide explanations for its predictions.

The company also introduced new ways for developers to manage API keys and understand API usage. Developers can now create multiple API keys with different permissions and scopes, and monitor their usage and billing details on the OpenAI Dashboard. The company also said that it will soon lower the pricing on its GPT-3.5 Turbo model by 25%, making it more accessible for developers to build applications with it.

OpenAI said that these updates are part of its continuous efforts to improve its models and services, and to make them more useful and affordable for developers and customers. The company also invited developers to contribute evaluations to help it improve the model for different use cases. The company said that it will continue to release new models, features, and tools in the future.

VentureBeat’s mission is to be a digital town square for technical decision-makers to gain knowledge about transformative enterprise technology and transact. Discover our Briefings.


Michael Nuñez

Source link

You May Also Like

Make File Navigating a Breeze with This Convenient Software for Mac or Windows | Entrepreneur

Disclosure: Our goal is to feature products and services that we think…

This bond ETF is a bet on ‘higher for longer’ and uses options to find extra income

Wall Street keeps pushing back its predictions for recession and interest rate…

Bed Bath & Beyond plans to sell $300 million in stock, again warns of bankruptcy By Reuters

© Reuters. FILE PHOTO: A shopping cart is seen at a Bed…

US casinos top $60B in revenue in 2022, their best year ever | Long Island Business News

Commercial casinos in the United States won more than $60 billion from…