How to Install run EmbeddingGemma 300m locally
Step-by-step walkthrough: How to Install run EmbeddingGemma 300m locally — install, configure, and run with notes from hands-on testing.
Filter by tag or browse the full feed. New blogs appear here as soon as they're written.
Step-by-step walkthrough: How to Install run EmbeddingGemma 300m locally — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: The Open Source App Builder that Ate SaaS Dyad Ollama Setup — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: Zero to Docs Hero Create a Python Documentation Generator with GPT 5 — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: How i Built a GPT OSS 120b Parameter Coding Beast that Reviews Fixes and Writes — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: How to Install Devstral Small 11 locally — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: LLMs Under Fire Red Teaming with Deepteam Ollama — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: How to Install DeepSeek Nano vLLM locally — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: How to Install Fanar 1 9b Arabic English LLM locally — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: How to Install and run Sarvam M locally — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: How to Install NVIDIA Acereason Nemotron 14b locally — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: How to Install NanoVLM Worlds Smallest Model locally — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: How to Install SmolDocling 256m Preview locally — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: How to Install Qwen3 32b GGUF locally — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: How to Install Meta Perception Lm 8b locally — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: How to Install NVIDIA Parakeet Tdt 06b V2 locally — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: How to Install Nari Dia 16 B locally — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: How to Install Falcon 3 locally — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: How to Install Google PaliGemma 2 locally — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: Run Langtrace Open Source Observability Tool for LLM Applications — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: Running AI Models with Open WebUI — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: 40 Linux Commands You Need to Know the Ultimate Guide for Ubuntu Users — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: Best Low Code Platforms for Building Applications in 2024 — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: How to Deploy Llama 31 405b in the cloud — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: How to Install and run AuraFlow Image Generator locally — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: How to Deploy Molmo 7b D 0924 in the cloud — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: How to Configure WireGuard VPN in the cloud — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: How to Deploy Llama 31 Nemotron 70b Instruct on a Virtual Machine in the cloud — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: How to Deploy Pixtral 12b in the cloud — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: Running a Dedicated Ethereum RPC Node in a Virtual Machine — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: How to Deploy Granite Moe 1b and 3b in the cloud — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: How to Deploy Solar Pro 22b in the cloud — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: How to Deploy Granite Dense 2b and 8b on a Virtual Machine in the cloud — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: How to run for Inference Llama 31 Nemotron 51b Instruct — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: How to Install Minikube on Ubuntu Virtual Machine — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: How to Deploy InternVL2 2b in the cloud — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: How to Deploy SmolLM2 17b on a Virtual Machine in the cloud with Ollama — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: How to Install run Microsoft Kosmos 25 locally — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: DeepSeek V31 Meets Promptfoo Jailbreaks Biases Beyond — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: The GPT 5 Paradox Genius in Thought Gaps in Safety — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: Reproducible LLM Benchmarking GPT 5 vs Grok 4 with Promptfoo — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: The OCR Model that Outranks GPT 4o — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: The One Click GPT 5 Code Machine how i Built my Own AI Developer — install, configure, and run with notes from hands-on testing.
Step-by-step walkthrough: 100 Game Changing Chatgpt Prompts for Developers Product Managers Designers and — install, configure, and run with notes from hands-on testing.
A complete walkthrough of Medusa — scanning AI agents, MCP servers, and LLM apps for misconfigurations, prompt injection surfaces, and deployment risks.
Architecting a multi-agent trading stack — research agents, risk agents, execution agents, and the orchestration layer that makes them behave like a desk.
How autonomous AI pentesting actually works in 2026 — tooling, workflows, guardrails, and what to expect when you point an agent at your own perimeter.
The OSINT stack I reach for in 2026 — hacker search engines, dorking patterns, and practical recon workflows for security researchers.
28 purpose-built subagents for offensive security — recon, exploit chaining, reporting — all orchestrated through Claude Code.
Twenty-five OSS security tools that punch above their price tag — free, battle-tested, and enough to run a lean security program.
Ten repos that cover the full solo-engineer surface area — from CI to observability to AI agents — without hiring a team.
Six agent frameworks that are quietly reshaping how teams ship — not demos, but tools people actually run in production.
Swapping cloud Codex for a fully local Gemma 4 + Ollama coding agent — setup, prompts, and where it wins (and where it doesn't).
Part two of the agent framework roundup — newer entrants, architecture patterns, and which ones are worth betting on this year.
Every serious AI browser and web-agent option in 2026 — compared on privacy, autonomy, extensibility, and real-world usefulness.
Multica as an OSS answer to managed Claude agents — architecture, setup, and how it compares for teams that want control.
A hands-on tour of Mastra — workflows, memory, tool calling, and the patterns that make agents production-grade instead of demo-grade.
AMD's GAIA stack for running intelligent workloads locally — hardware sizing, install, and the use cases where on-device wins.
Nineteen tools for tracing, evaluating, and debugging LLM agents in production — with notes on when each one actually earns its keep.
Underrated agent frameworks hiding in plain sight — why they matter, what they do differently, and links to get started.
Archinstall 4.0 makes Arch approachable again — full feature tour, install walkthrough, and the defaults I'd change on day one.
CAI and the shift toward AI-native security automation — what it automates, what it can't, and how teams should adopt it.
AgentScope for multi-agent apps that survive contact with reality — roles, messaging, failure handling, and a working example.
How OpenViking rethinks agent memory beyond flat vector stores — context layers, retrieval, and why it matters for long-running agents.
Strix as an autonomous offensive testing tool — setup, attack surfaces it finds, and how to run it safely against your own apps.
The mobile pentest toolchain for 2026 — static, dynamic, network, and the workflows that tie them together on real engagements.
Eight OSS tools bringing AI into the pentest workflow — recon, exploitation assist, reporting — and how to evaluate them responsibly.
Hindsight's approach to durable agent memory — episodic recall, structured context, and why vectors alone aren't enough.
How OpenClaw grew from a weekend project into a self-hosted personal AI platform — architecture, philosophy, and getting started.
Eleven serving engines compared for production — throughput, ops burden, GPU efficiency, and which I'd pick for each workload.
Twenty OSS projects that form a complete agent-building arsenal — orchestration, tools, memory, eval, and deployment.
The Model Context Protocol ecosystem has exploded. Here are the 10 open-source MCP servers I'd put in every agent stack going into 2026.
A field guide to running production-grade autonomous agents on zero API spend — 20 hand-picked open-source tools, real workflows, and the gotchas nobody mentions.
I spent a week trying to break Moonshot's Kimi K2 with Promptfoo. Here's where it cracked, where it surprised me, and what I'd put in a red-team playbook.
Pointing Promptfoo at Qwen3-Coder 480B-A35B revealed a surprising attack surface. Walking through the audit, the findings, and the patterns that repeat across frontier coder models.
A repeatable playbook for security-auditing any LLM — demonstrated end-to-end on GLM-4.5 using Promptfoo. Test plans, harnesses, and how to read the signal.
Setting up DeepTeam against local Ollama models to stress-test prompt injection, jailbreaks, and harmful-output rates — all without leaving your laptop.
Four major open-source LLM red-teaming frameworks, one merciless head-to-head. Coverage, ergonomics, plugin ecosystem, and which I'd actually ship with.
DeepSeek-R1 is impressive — and a great red-team target. Here's the full Promptfoo + Ollama harness I used to stress-test reasoning, refusals, and exploits.
Side-by-side visual evaluations of OpenAI hosted models and local Ollama runs — how to build a faithful eval harness when ground truth is fuzzy.
An honest engineering diary on shipping an AI book-writing pipeline — Agent Communication Protocol, evaluator harness, and the failure modes that surprised me.
OpenAI shipped GPT-OSS — and yes, you can run it on your own metal. The complete setup, hardware sizing, and the inference tricks that matter.
Going beyond the base GPT-OSS — running the 20B and full 120B GGUF builds locally with sane quantization, sane VRAM, and reproducible benchmarks.
Wrangling Qwen3-Coder's 480B MoE on local hardware: weight layout, KV-cache strategies, and the dev workflow that actually keeps the model fast.
DeepSeek V3.1 in GGUF form is a strong daily driver. Hardware sizing, quant trade-offs, and an end-to-end install walkthrough.
Side-by-side benchmarks of Claude 4 Opus and Sonnet, with a practical Claude Code workflow that gets the most out of each tier.
OpenCode is the open-source coding agent that finally feels like Claude Code. A complete hands-on guide — install, configure, hook up local models.
Void is an open-source Cursor-style editor. Wired to Ollama, it becomes a local AI workbench that doesn't phone home. Full setup inside.
Zed is fast, Ollama is portable, and a GPU VM is finally cheap. Here's the dev setup I run all day — IDE, model server, and the network glue.
OpenHands lets you ship working apps with an autonomous engineer agent. The install, the workflows, and the prompts I've found ship 10x more reliably.
Llama 4 is here — and it's tool-calling-native. Setting it up locally end-to-end, with a working agentic loop and traces you can inspect.
Llama 3.3 70B Instruct is the sweet-spot OSS model for most workloads. Full local-install guide with quantization and serving tips.
A complete tour of the DeepSeek lineup — Tiny, Small, and VL2 — with reproducible inference scripts and a Gradio UI for quick experiments.
DeepSeek Janus-Pro 7B is a strong multimodal model. The full install, weights layout, and a minimal inference example you can build on.
A pragmatic cloud-deploy walkthrough for Qwen2.5-Coder-32B-Instruct — sizing, serving, and the cost-vs-throughput knobs that matter.
Combine Ollama with Web-LLM to build a local AI search assistant — private, fast, and bookmark-friendly. The full architecture and code inside.
Mistral Magistral is the model I keep coming back to for reasoning tasks. A clean, reproducible local-install guide with serving notes.
Mistral Voxtral brings speech-native intelligence to local stacks. A complete install + first-app guide with audio pipelines.
Tencent's Hunyuan3D-World 1.0 turns text into navigable 3D scenes. The local install, GPU sizing, and a first scene-generation walkthrough.
FLUX.1 Kontext Dev is a context-aware image model that punches above its weight. Full local install and a workflow notebook to copy.
Running FLUX.1 Schnell and Pro on a GPU VM with sensible defaults, queueing, and a UI you can put in front of a team.
An end-to-end build of a chest X-ray analyzer using Google's MedGemma 27B and Gradio — model loading, preprocessing, and a usable clinician-facing UI.
A complete walkthrough of building a production-grade time-series forecaster on top of Datadog's open Toto Base 1.0 — schemas, batching, deployment.
VulHunt — an open-source framework for hunting vulnerabilities at scale. Architecture, plugin model, and how I use it on bug-bounty engagements.
The 50 BlackArch tools I reach for most on penetration tests — what each does, when to use it, and the command patterns I keep cheat-sheeted.
Beyond the terminal — the 15 Kali GUI tools every security engineer should be fluent with, each illustrated with a real engagement scenario.
Operating a dedicated Ethereum RPC node on a VM — disk planning, snapshot tactics, peering, and the monitoring that catches bad sync states early.
A complete backup architecture for VMs into S3-compatible object storage — Borg + Borgmatic + Rclone + cron rsync, with the restore drill I run quarterly.