Ollama 提供与 OpenAI API 部分内容的实验兼容性,以帮助将现有应用程序连接到 Ollama。
介绍
OpenAI 兼容性是试验性的,可能会有重大调整,包括破坏性更改。有关 Ollama API 的全功能访问,请参阅 Ollama Python 库、JavaScript 库 和 REST API。
使用
OpenAI Python 库
from openai import OpenAI
client = OpenAI(
base_url='http://localhost:11434/v1/',
# required but ignored
api_key='ollama',
)
chat_completion = client.chat.completions.create(
messages=[
{
'role': 'user',
'content': 'Say this is a test',
}
],
model='llama3.2',
)
response = client.chat.completions.create(
model="llava",
messages=[
{
"role": "user",
"content": [
{"type": "text", "text": "What's in this image?"},
{
"type": "image_url",
"image_url": "",
},
],
}
],
max_tokens=300,
)
completion = client.completions.create(
model="llama3.2",
prompt="Say this is a test",
)
list_completion = client.models.list()
model = client.models.retrieve("llama3.2")
embeddings = client.embeddings.create(
model="all-minilm",
input=["why is the sky blue?", "why is the grass green?"],
)
结构化输出
from pydantic import BaseModel
from openai import OpenAI
client = OpenAI(base_url="http://localhost:11434/v1", api_key="ollama")
# Define the schema for the response
class FriendInfo(BaseModel):
name: str
age: int
is_available: bool
class FriendList(BaseModel):
friends: list[FriendInfo]
try:
completion = client.beta.chat.completions.parse(
temperature=0,
model="llama3.1:8b",
messages=[
{"role": "user", "content": "I have two friends. The first is Ollama 22 years old busy saving the world, and the second is Alonso 23 years old and wants to hang out. Return a list of friends in JSON format"}
],
response_format=FriendList,
)
friends_response = completion.choices[0].message
if friends_response.parsed:
print(friends_response.parsed)
elif friends_response.refusal:
print(friends_response.refusal)
except Exception as e:
print(f"Error: {e}")
OpenAI JavaScript 库
import OpenAI from 'openai';
const openai = new OpenAI({
baseURL: 'http://localhost:11434/v1/',
// required but ignored
apiKey: 'ollama'
});
const chatCompletion = await openai.chat.completions.create({
messages: [{ role: 'user', content: 'Say this is a test' }],
model: 'llama3.2'
});
const response = await openai.chat.completions.create({
model: 'llava',
messages: [
{
role: 'user',
content: [
{ type: 'text', text: "What's in this image?" },
{
type: 'image_url',
image_url:
''
}
]
}
]
});
const completion = await openai.completions.create({
model: 'llama3.2',
prompt: 'Say this is a test.'
});
const listCompletion = await openai.models.list();
const model = await openai.models.retrieve('llama3.2');
const embedding = await openai.embeddings.create({
model: 'all-minilm',
input: ['why is the sky blue?', 'why is the grass green?']
});
curl
curl http://localhost:11434/v1/chat/completions \
-H "Content-Type: application/json" \
-d '{
"model": "llama3.2",
"messages": [
{
"role": "system",
"content": "You are a helpful assistant."
},
{
"role": "user",
"content": "Hello!"
}
]
}'
curl http://localhost:11434/v1/chat/completions \
-H "Content-Type: application/json" \
-d '{
"model": "llava",
"messages": [
{
"role": "user",
"content": [
{
"type": "text",
"text": "What'\''s in this image?"
},
{
"type": "image_url",
"image_url": {
"url": ""
}
}
]
}
],
"max_tokens": 300
}'
curl http://localhost:11434/v1/completions \
-H "Content-Type: application/json" \
-d '{
"model": "llama3.2",
"prompt": "Say this is a test"
}'
curl http://localhost:11434/v1/models
curl http://localhost:11434/v1/models/llama3.2
curl http://localhost:11434/v1/embeddings \
-H "Content-Type: application/json" \
-d '{
"model": "all-minilm",
"input": ["why is the sky blue?", "why is the grass green?"]
}'
API 接口
/v1/chat/completions
支持的特性
支持的请求字段
/v1/completions
支持的特性
支持的请求字段
说明
/v1/models
说明
created
对应于模型最后一次修改的时间
owned_by
对应于 ollama 用户名,默认为 library
/v1/models/{model}
说明
created
对应于模型最后一次修改的时间
owned_by
对应于 ollama 用户名,默认为 library
/v1/embeddings
支持的请求字段
模型
使用模型前,先将其拉到本地 ollama pull
:
默认模型名称
对于依赖于默认 OpenAI 模型名称(如 gpt-3.5-turbo
)的工具,可使用 ollama cp
将现有模型名称复制到临时名称:
ollama cp llama3.2 gpt-3.5-turbo
之后,可以在 model
字段中指定这个新模型名称:
curl http://localhost:11434/v1/chat/completions \
-H "Content-Type: application/json" \
-d '{
"model": "gpt-3.5-turbo",
"messages": [
{
"role": "user",
"content": "Hello!"
}
]
}'
设置上下文大小
OpenAI API 没有设置模型上下文大小的方法。如果您需要更改上下文大小,请创建一个 Modelfile
,其内容如下:
FROM <some model>
PARAMETER num_ctx <context size>
使用 ollama create mymodel
命令以更新的上下文大小创建新模型。使用更新后的模型名称调用 API:
curl http://localhost:11434/v1/chat/completions \
-H "Content-Type: application/json" \
-d '{
"model": "mymodel",
"messages": [
{
"role": "user",
"content": "Hello!"
}
]
}'