ollama 使用问题总结分享 (持续更新)

caicongyang

2500人浏览 · 2024-06-27 19:40:41

caicongyang · 2024-06-27 19:40:41 发布

1. ollama 无法使用所有的cpu ? Ollama is only using 4 of my 8 Cores

Ollama does not have an option parameter to define the number of CPUs (it does for GPUs), but you can try setting num_threads to a value much higher than 8 (default value), and see how it works for you:

curl --location 'http://127.0.0.1:11434/api/chat' \
--header 'Content-Type: application/json' \
--data '{
  "model": "llama3",
  "messages": [
    { "role": "user", "content": "why is the sky blue?" }
  ],
  "options":{
    "num_thread": 16
  }
}'

https://github.com/ollama/ollama/blob/main/docs/api.md#generate-request-with-options

2. ollama 使用定制模型

2.1 使用llama.cpp转换成gguf 格式参考

https://github.com/ggerganov/llama.cpp/discussions/2948
2.2 使用ollmma 部署通用模型参加

https://github.com/ollama/ollama?tab=readme-ov-file#customize-a-model

2.3 发布提供给别人使用

https://ollama.com/caicongyang/Qwen2.5-1.5B-Instruct-web3

https://edu.csdn.net/learn/39067/627173?utm_source=2019755004

汇聚全球AI编程工具，助力开发者即刻编程。

更多推荐

打通OpenClaw与Claude Code会话链路，实现AI全自动编码落地

这套双AI智能体协同方案，完美解决当前AI编码工具半自动、会话中断、上下文丢失三大落地痛点，中小型研发团队可直接上线承接日常后端、前端全品类开发需求，把重复性编码、任务跟进、工单提报工作全部托管给AI集群。在官方原生多Agent调度能力完善之前，该方案是兼容性、稳定性、落地成本最优的无人AI开发实施方案。

AI编程社区

2026 年 7 月：怎么冲 GPT 会员？先分清 ChatGPT Plus、Pro、Codex 和 API

AI编程社区

用 Ace Data Cloud 快速接入 OpenAI Chat Completion API：对话、流式输出、多轮上下文一次搞定

# 用 Ace Data Cloud 快速接入 OpenAI Chat Completion API：对话、流式输出、多轮上下文一次搞定想把 ChatGPT 能力接入自己的产品，但又不想在账号、额度、模型切换、接口调试和多语言对接上反复折腾？如果你正在开发智能客服、内容生成、代码助手、知识库问答、数据分析助手，或者任何需要大模型对话能力的应用，**Ace Data Cloud 的 OpenAI