What is the memory usage of different models?

The memory usage varies by model size. For example, the tiny model uses ~273 MB, the base model uses ~388 MB, and the large model uses ~3.9 GB.

Does whisper.cpp support GPU inference?

Yes, whisper.cpp supports efficient GPU support for NVIDIA, OpenVINO, Ascend NPU, and Moore Threads GPU.

whisper.cpp

whisper.cpp | Find AI List

Overview

whisper.cpp is a C/C++ port of OpenAI's Whisper model, designed for high-performance inference of automatic speech recognition (ASR). It offers a lightweight implementation, making it suitable for integration across diverse platforms. The core implementation is contained in whisper.h and whisper.cpp, utilizing the ggml machine learning library. Key features include Apple Silicon optimization (ARM NEON, Accelerate framework, Metal, Core ML), AVX/VSX intrinsics support, mixed F16/F32 precision, and integer quantization. It supports CPU-only inference, efficient GPU support (NVIDIA, OpenVINO, Ascend NPU, Moore Threads GPU), and a C-style API. Platforms supported include Mac OS, iOS, Android, Java, Linux, WebAssembly, Windows, Raspberry Pi, and Docker. Zero memory allocations at runtime are also a focus, enhancing efficiency.

Common tasks

Speech-to-Text Transcription Voice Activity Detection

FAQ

View all

What platforms are supported by whisper.cpp?

whisper.cpp supports a wide range of platforms, including Mac OS (Intel and Arm), iOS, Android, Java, Linux / FreeBSD, WebAssembly, Windows (MSVC and MinGW), Raspberry Pi, and Docker.

What is GGML?

GGML is the machine learning library that whisper.cpp uses for its implementation. It allows for lightweight integration of the model into various platforms and applications.

How do I quantize a model with whisper.cpp?

You can quantize a model using the quantize tool. For example: ./build/bin/quantize models/ggml-base.en.bin models/ggml-base.en-q5_0.bin q5_0

How do I enable Core ML support?

To enable Core ML support, build whisper.cpp with the -DWHISPER_COREML=1 flag: cmake -B build -DWHISPER_COREML=1

FAQ+