ollam 提供兼容 OpenAI 格式的API

发布时间: 更新时间: 总字数:961 阅读时间:2m 作者: IP上海 分享 网址

Ollama 提供与 OpenAI API 部分内容的实验兼容性,以帮助将现有应用程序连接到 Ollama。

介绍

OpenAI 兼容性是试验性的,可能会有重大调整,包括破坏性更改。有关 Ollama API 的全功能访问,请参阅 Ollama Python 库JavaScript 库REST API

使用

OpenAI Python 库

from openai import OpenAI

client = OpenAI(
    base_url='http://localhost:11434/v1/',

    # required but ignored
    api_key='ollama',
)

chat_completion = client.chat.completions.create(
    messages=[
        {
            'role': 'user',
            'content': 'Say this is a test',
        }
    ],
    model='llama3.2',
)

response = client.chat.completions.create(
    model="llava",
    messages=[
        {
            "role": "user",
            "content": [
                {"type": "text", "text": "What's in this image?"},
                {
                    "type": "image_url",
                    "image_url": "",
                },
            ],
        }
    ],
    max_tokens=300,
)

completion = client.completions.create(
    model="llama3.2",
    prompt="Say this is a test",
)

list_completion = client.models.list()

model = client.models.retrieve("llama3.2")

embeddings = client.embeddings.create(
    model="all-minilm",
    input=["why is the sky blue?", "why is the grass green?"],
)

结构化输出

from pydantic import BaseModel
from openai import OpenAI

client = OpenAI(base_url="http://localhost:11434/v1", api_key="ollama")

# Define the schema for the response
class FriendInfo(BaseModel):
    name: str
    age: int
    is_available: bool

class FriendList(BaseModel):
    friends: list[FriendInfo]

try:
    completion = client.beta.chat.completions.parse(
        temperature=0,
        model="llama3.1:8b",
        messages=[
            {"role": "user", "content": "I have two friends. The first is Ollama 22 years old busy saving the world, and the second is Alonso 23 years old and wants to hang out. Return a list of friends in JSON format"}
        ],
        response_format=FriendList,
    )

    friends_response = completion.choices[0].message
    if friends_response.parsed:
        print(friends_response.parsed)
    elif friends_response.refusal:
        print(friends_response.refusal)
except Exception as e:
    print(f"Error: {e}")

OpenAI JavaScript 库

import OpenAI from 'openai';

const openai = new OpenAI({
  baseURL: 'http://localhost:11434/v1/',

  // required but ignored
  apiKey: 'ollama'
});

const chatCompletion = await openai.chat.completions.create({
  messages: [{ role: 'user', content: 'Say this is a test' }],
  model: 'llama3.2'
});

const response = await openai.chat.completions.create({
  model: 'llava',
  messages: [
    {
      role: 'user',
      content: [
        { type: 'text', text: "What's in this image?" },
        {
          type: 'image_url',
          image_url:
            ''
        }
      ]
    }
  ]
});

const completion = await openai.completions.create({
  model: 'llama3.2',
  prompt: 'Say this is a test.'
});

const listCompletion = await openai.models.list();

const model = await openai.models.retrieve('llama3.2');

const embedding = await openai.embeddings.create({
  model: 'all-minilm',
  input: ['why is the sky blue?', 'why is the grass green?']
});

curl

curl http://localhost:11434/v1/chat/completions \
    -H "Content-Type: application/json" \
    -d '{
        "model": "llama3.2",
        "messages": [
            {
                "role": "system",
                "content": "You are a helpful assistant."
            },
            {
                "role": "user",
                "content": "Hello!"
            }
        ]
    }'

curl http://localhost:11434/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "llava",
    "messages": [
      {
        "role": "user",
        "content": [
          {
            "type": "text",
            "text": "What'\''s in this image?"
          },
          {
            "type": "image_url",
            "image_url": {
               "url": ""
            }
          }
        ]
      }
    ],
    "max_tokens": 300
  }'

curl http://localhost:11434/v1/completions \
    -H "Content-Type: application/json" \
    -d '{
        "model": "llama3.2",
        "prompt": "Say this is a test"
    }'

curl http://localhost:11434/v1/models

curl http://localhost:11434/v1/models/llama3.2

curl http://localhost:11434/v1/embeddings \
    -H "Content-Type: application/json" \
    -d '{
        "model": "all-minilm",
        "input": ["why is the sky blue?", "why is the grass green?"]
    }'

API 接口

/v1/chat/completions

支持的特性

  • Chat completions
  • Streaming
  • JSON mode
  • Reproducible outputs
  • Vision
  • Tools
  • Logprobs

支持的请求字段

  • model
  • messages
    • Text content
    • Image content
      • Base64 encoded image
      • Image URL
    • Array of content parts
  • frequency_penalty
  • presence_penalty
  • response_format
  • seed
  • stop
  • stream
  • stream_options
    • include_usage
  • temperature
  • top_p
  • max_tokens
  • tools
  • tool_choice
  • logit_bias
  • user
  • n

/v1/completions

支持的特性

  • Completions
  • Streaming
  • JSON mode
  • Reproducible outputs
  • Logprobs

支持的请求字段

  • model
  • prompt
  • frequency_penalty
  • presence_penalty
  • seed
  • stop
  • stream
  • stream_options
    • include_usage
  • temperature
  • top_p
  • max_tokens
  • suffix
  • best_of
  • echo
  • logit_bias
  • user
  • n

说明

  • prompt 目前只接受一个字符串

/v1/models

说明

  • created 对应于模型最后一次修改的时间
  • owned_by 对应于 ollama 用户名,默认为 library

/v1/models/{model}

说明

  • created 对应于模型最后一次修改的时间
  • owned_by 对应于 ollama 用户名,默认为 library

/v1/embeddings

支持的请求字段

  • model
  • input
    • string
    • array of strings
    • array of tokens
    • array of token arrays
  • encoding format
  • dimensions
  • user

模型

使用模型前,先将其拉到本地 ollama pull:

ollama pull llama3.2

默认模型名称

对于依赖于默认 OpenAI 模型名称(如 gpt-3.5-turbo )的工具,可使用 ollama cp 将现有模型名称复制到临时名称:

ollama cp llama3.2 gpt-3.5-turbo

之后,可以在 model 字段中指定这个新模型名称:

curl http://localhost:11434/v1/chat/completions \
    -H "Content-Type: application/json" \
    -d '{
        "model": "gpt-3.5-turbo",
        "messages": [
            {
                "role": "user",
                "content": "Hello!"
            }
        ]
    }'

设置上下文大小

OpenAI API 没有设置模型上下文大小的方法。如果您需要更改上下文大小,请创建一个 Modelfile,其内容如下:

FROM <some model>
PARAMETER num_ctx <context size>

使用 ollama create mymodel 命令以更新的上下文大小创建新模型。使用更新后的模型名称调用 API:

curl http://localhost:11434/v1/chat/completions \
    -H "Content-Type: application/json" \
    -d '{
        "model": "mymodel",
        "messages": [
            {
                "role": "user",
                "content": "Hello!"
            }
        ]
    }'

参考

  1. https://github.com/ollama/ollama/blob/v0.5.7/docs/openai.md
Home Archives Categories Tags Statistics
本文总阅读量 次 本站总访问量 次 本站总访客数