ollama

$npx mdskill add mkurman/zorai/ollama

Run 100+ local LLMs with GPU acceleration and custom configs.

  • Executes Llama, Mistral, and other models locally without cloud costs.
  • Uses OpenAI-compatible API and supports custom Modelfile configurations.
  • Leverages CUDA and Metal for hardware-accelerated inference performance.
  • Delivers text responses via command line or Python client integration.

SKILL.md

.github/skills/ollamaView on GitHub ↗
---
name: ollama
description: "Local LLM runner. One-command setup for Llama, Mistral, Gemma, Qwen, DeepSeek, Phi, and 100+ models. OpenAI-compatible API, model management, GPU acceleration, and custom Modelfile creation."
tags: [ollama, local-llm, llm, inference, api, modelfile, zorai]
---
## Overview

Ollama runs LLMs locally with a single command. Supports Llama 3, Mistral, Gemma, Qwen 2.5, DeepSeek, Phi, and 100+ models with GPU acceleration (CUDA/Metal), OpenAI-compatible API, and custom Modelfiles for configuration.

## Installation

```bash
# macOS / Linux
curl -fsSL https://ollama.com/install.sh | sh
```

## Basic Usage

```bash
ollama pull llama3.1:8b
ollama run llama3.1:8b "Explain quantum computing"
```

## Python API

```python
import openai
client = openai.OpenAI(base_url="http://localhost:11434/v1", api_key="ollama")
resp = client.chat.completions.create(
    model="llama3.1:8b",
    messages=[{"role": "user", "content": "What is ML?"}],
)
print(resp.choices[0].message.content)
```

## Custom Modelfile

```dockerfile
FROM llama3.1:8b
PARAMETER temperature 0.3
SYSTEM "You are a medical coding assistant."
```

```bash
ollama create my-coder -f Modelfile
```

## References
- [Ollama docs](https://github.com/ollama/ollama)
- [Ollama library](https://ollama.com/library)

More from mkurman/zorai

SkillDescription
account-management>
agile-scrum>
albumentationsFast image augmentation library (Albumentations). 70+ transforms for classification, segmentation, object detection, keypoints, and pose estimation. Optimized OpenCV-based pipeline with unified API across all CV tasks. Supports images, masks, bounding boxes, and keypoints simultaneously. Note: classic Albumentations (MIT) is no longer maintained; successor AlbumentationsX uses AGPL-3.0. For torchvision-native augmentations, use torchvision.transforms.v2.
aml-complianceAnti-Money Laundering (AML) and Know Your Customer (KYC) compliance workflow. Sanctions screening, PEP detection, transaction monitoring, suspicious activity reporting (SAR), and OFAC compliance.
anki-connectThis skill is for interacting with Anki through AnkiConnect, and should be used whenever a user asks to interact with Anki, including to read or modify decks, notes, cards, models, media, or sync operations.
approval-checkpoint-long-taskCanonical long-task pack for daemon-managed work with deliberate approval checkpoints, status summaries, rollback notes, and mobile-safe governance-aware updates.
auditing-goal-artifactsUse when reviewing recent zorai goal run outputs, closure markers, ledgers, or evidence bundles to judge whether completion is credible or to identify remaining uncertainty.
autogenAutoGen (Microsoft) — multi-agent conversation framework. Agent-to-agent chat, code generation & execution, tool use, group chat, and human-in-the-loop. Build collaborative AI systems with specialized agents.
backtraderPython backtesting framework for trading strategies. Data feeds, brokers, analyzers, and live trading support. Strategy development with commission models, slippage, and signal-based execution.
beautiful-mermaidRender Mermaid diagrams as SVG and PNG using the Beautiful Mermaid library. Use when the user asks to render a Mermaid diagram.