Blaed@lemmy.world to

Technology@lemmy.worldEnglish · 1 year ago

Introducing Stable-Diffusion.cpp (Inference in Pure C/C++)

2

3

Introducing Stable-Diffusion.cpp (Inference in Pure C/C++)

Blaed@lemmy.world to

Technology@lemmy.worldEnglish · 1 year ago

2

cross-posted from: https://lemmy.world/post/3549390

stable-diffusion.cpp

Introducing stable-diffusion.cpp, a pure C/C++ inference engine for Stable Diffusion! This is a really awesome implementation to help speed up home inference of diffusion models.

Tailored for developers and AI enthusiasts, this repository offers a high-performance solution for creating and manipulating images using various quantization techniques and accelerated inference.

https://github.com/leejet/stable-diffusion.cpp

Key Features:

Efficient Implementation: Utilizing plain C/C++, it operates seamlessly like llama.cpp and is built on the ggml framework.

Multiple Precision Support: Choose between 16-bit, 32-bit float, and 4-bit to 8-bit integer quantization.

Optimized Performance: Experience memory-efficient CPU inference with AVX, AVX2, and AVX512 support for x86 architectures.

Versatile Modes: From original txt2img to img2img modes and negative prompt handling, customize your processing needs.

Cross-Platform Compatibility: Runs smoothly on Linux, Mac OS, and Windows.

Getting Started

Cloning, building, and running are made simple, and detailed examples are provided for both text-to-image and image-to-image generation. With an array of options for precision and comprehensive usage guidelines, you can easily adapt the code for your specific project requirements.
git clone --recursive https://github.com/leejet/stable-diffusion.cpp
cd stable-diffusion.cpp
If you have already cloned the repository, you can use the following command to update the repository to the latest code.
cd stable-diffusion.cpp
git pull origin master
git submodule update
More Details

Plain C/C++ implementation based on ggml, working in the same way as llama.cpp

16-bit, 32-bit float support

4-bit, 5-bit and 8-bit integer quantization support

Accelerated memory-efficient CPU inference

Only requires ~2.3GB when using txt2img with fp16 precision to generate a 512x512 image

AVX, AVX2 and AVX512 support for x86 architectures

Original txt2img and img2img mode

Negative prompt

stable-diffusion-webui style tokenizer (not all the features, only token weighting for now)

Sampling method

Euler A

Supported platforms

Linux

Mac OS

Windows

This is a really exciting repo. I’ll be honest, I don’t think I am as well versed in what’s going on for diffusion inference - but I do know more efficient and effective methods running those models are always welcome by people frequently using diffusers. Especially for those who need to multi-task and maintain performance headroom.

You must log in or register to comment.

Chat

JackGreenEarth@lemm.ee
link
fedilink
English
arrow-up
0·
1 year ago
Does this run faster than the python model?
- olicvb@lemmy.ca
  link
  fedilink
  English
  arrow-up
  1·
  edit-2
  1 year ago
  Got a 1.42s generation on the Cpp one and 2.1s with auto1111’s SD (note my torch is outdated, model was converted to fp16).
  
  Though i’m having trouble finding the generated image 😅.
  
  All on the same generation settings, 5800x cpu & 3080 12gb

Technology@lemmy.world

technology@lemmy.world

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !technology@lemmy.world

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each another!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, to ask if your bot can be added please contact us.
Check for duplicates before posting, duplicates may be removed

Approved Bots

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

6.33K users / day
9.41K users / week
17.4K users / month
34.9K users / 6 months
1 local subscriber
59.3K subscribers
11.3K Posts
484K Comments
Modlog