Tools and Resources for AI Art
Curious about AI art? Start with apps like Midjourney or NightCafe, then explore prompt engineering and tools like Deforum or NMKD to go deeper.

-
Apps
No Code AI Art tools
- Artbreeder - StyleGAN model with "genes" (directions in latent space) for editing
- Artbreeder Collage - CLIP guided diffusion on top of simple collages
- Astria.ai - nice and easy Dreambooth training - upload images, and get finetuned SD model
- BlueWillow - text to image Discord like MidJourney (appears to use Stable Diffusion finetunes)
- CogView - text to image, Chinese model like DALL-E ( interview )
- conjure.art - new text to image site currently in beta
- craiyon - formerly known as dall-e mini, free and makes quick grids of 9 outputs
- Dall-e 2 - OpenAI's text to image
- DeepDreamGenerator - deep style, thin style, deep dream
- DreamStudio - easy to use text to image from creators of Stable Diffusion
- Genmo - short animations (looks like KLMC2) ?
- Kaiber - create short animations (looks like Deforum) ?
- midjourney - text to image via discord bot
- murf.ai - text to speech with Ai voices
- neural.love image-upscale - credit based image upscaling service
- NightCafe - style transfer, VQGAN, diffusion image generation
- Ostagram - style transfer
- Playform - style transfer, train stylegans, images morphs
- pollinations.ai - run lots of popular notebooks
- ProsePainter - interactive tool to "paint with words"
- runwayml - video editing powered by AI ?
- snowpixel - text to image and variations
- StarryAI - text to image with easy selection of styles
- synth.run - text to image app for iOS, Android, and web
- tokkingheads - animate portraits with Ai
- uberduck.ai - text to speech with lots of different voices
- Visions of Chaos - run the popular AI notebooks locally on Windows (see the Machine Learning setup steps)
- Wombo - Super fast and free
- wzrd.ai - give it a music file and produce animation from big set of pretrained GANs
Create Game Assets
- Layer - 2D assets and variations for games
- Luma AI Imagine 3D - alpha test of text to 3D models
- MirageML - 3D assets and prototyping
- Scenario.gg - AI-generated game assets
- withpoly - AI-generated textures and materials
Online Galleries to Showcase Art
- OnCyber art galleries - https://oncyber.io - Cool 3D art gallery to showcase your art with links to NFT market
- Spatial - https://spatial.io
-
AI Art Discord Servers
There are quite a few Discord servers dedicated now to AI artists or discussing text to image techniques.
- Ai NFT Discord - AI NFT Consortium. Has especially useful StyleGAN training resources
- Disco Diffusion Discord - chat and tech support for the Disco notebook
- EleutherAI Discord - researchers and good art room with more technical discussions
- Jukebox Community Discord - server for using OpenAI Jukebox for music generation
- LAION Discord - group working on replicating a full DALLE-E
- NeuralismAI Discord - AI art competitions and knowledge exchange
- Prompt Sharing Discord - community for sharing text to image prompts
- VQGAN+CLIP Discord - home of Instagram #vqganclipcommunitycolab
- Zoetrope Central Spoke Discord - support and discussion of the Looking Glass notebook
-
Learn to Code Generative Ai
- The Illustrated Stable Diffusion - really nice overview of Stable Diffusion and the pieces that make it up
- AIAIART - really nice ongoing youtube series and discussion in its Discord
- Deep Learning for Art, Aesthetics, and Creativity - MIT course available on youtube
- Dive into Deep Learning (online, free, interactive)
- Deep Learning Foundations to Stable Diffusion - 4 videos from the fast.ai class
- Generative Deep Learning: Teaching Machines to Paint, Write, Compose, and Play by David Foster [2019]
Really enjoyed this and it's a great book! It's from 2019 so doesn't cover the very latest like VQGAN, CLIP, guided diffusion though. - HuggingFace Diffusion Models Class - nice coverage of the diffusers library and Stable Diffusion
- The Artist in the Machine: The world of AI-powered creativity by Arthur I. Miller [2020]
Not very technical but engaging and inspiring view of many Ai art projects so far. - ml4a.net - online textbook, classes, and learning resources
-
Music
You can generate music with AI using OpenAI's Jukebox. You can prompt Jukebox with an artist and music genre or with a short audio clip in WAV format. It generates new music for you in phases of increasing quality (level_2, level_1, level_0) and takes about 8 hours on Colab.
- aiva - ai composition of soundtracks and music
- amper - royalty free ai music creation
- AudioLDM - text to audio latent diffusion model (huggingface) (replicate) ? ?
- boomy - let's you create and publish music with Ai but they hold the copyright
- D3Net-MSS - colab for splitting music into separate clips for drums, vocals, etc
- Dance Diffusion - audio diffusion! (guide)
- Dance Diffusion Finetuning - fine tune on your own audio dataset
- Easy One Click Jukebox - this is my favorite currently
- Jukebox Community Build - download this ipynb and put in Colab Notebooks folder on Google Drive to use
- lalal.ai - commercial music to stems service
- Moises.ai - ai audio separation
- mubert - nft friendly music remixed by Ai
- Official OpenAI Jukebox - the official notebook from OpenAI
- riffusion - stable diffusion fine tuned on audio spectrograms! (web)
- Spleeter colab - split music into stems
- Zags Jukebox v3.7 - (youtube tutorial)
-
Prompt Engineering
To get good results with CLIP guided diffusion and VQGAN+CLIP you need to find the right words and phrases that will direct the neural network to the content and style you are looking for.
Image to Text
- Antarctic-Captions by @dzryk
- BLIP image captioning HuggingFace space
- CLIP Interrogator - image to prompt! (huggingface) (lambda) (replicate) ?
- CLIP prefix captioning inference notebook (github)
- LLaVa: Large Language and Vision Assistant - ask vision model to describe image
- personality-clip by @dzryk
- PEZ: Prompts made EZ - prompt from image or long to short prompt (huggingface) (colab)
Prompt Guides
- [2023/08/04] Stable Diffusion XL reference library - great guides for SDXL!!
- [2022/11/29] Stable Diffusion V2 CFG Scale Comparison - nice ref of samplers and cfg scale
- [2022/09/16] krea.ai search stable diffusion prompts and browse by modifiers
- [2022/09/07] libraire.ai search 10 million stable diffusion prompts and images
- [2022/08/28] Prompt Parrot v2.0 by @KyrickYoung (train gpt2 on prompt list then generate with stable-diff)
- [2022/08/24] Lexica stable diffusion prompt search engine
- [2022/08/13] Promptomania by @wszp - cool prompt building tool!
- [2022/08/08] Stable Diffusion Artist Studies by @proximasan @EErratica @KyrickYoung @surrailabs
- [2022/08/08] Stable Diffusion Modifier Studies by @proximasan +
- [2022/07/30] Disco Diffusion Portrait Study by @enviraldesign
- [2022/07/13] Dall-e 2 prompt book by @GuyP
- [2022/03/25] Disco Diffusion Modifiers Study by @KyrickYoung and @sureailabs
- [2022/03/21] DiscoDiffusion Model Comparison Study - by @KaliYuga
- [2022/03/05] Midjourney Artist Dump - spreadsheet or artists and example render
- [2022/02/26] Disco Diffusion 70+ Artist Studies
- A Guide to Writing Prompts for Text to Image - Google Doc guide and advice
- CLIP Retrieval Tool - see what kinds of images match strings for CLIP (wait a long time for it to load)
- CLIP Prompt Engineering for Generative Art - nice long guide by Matthew McAteer
- CLIP + VQGAN keyword comparison by @kingdomakrillic
- Artist Studies by @remi_durant - big collection of results using different artist names
- Art Movements and Styles as perceived by VQGAN + Clip (Imagenet 16k, ViT-B/32)
- Art Movements and Styles as perceived by VQGAN + Clip (Imagenet 16k, ViT-B/16)
- Art Movements and Styles as perceived by VQGAN + Clip (Imagenet 16k, RN50x16)
- Art Movements and Styles as perceived by VQGAN + CLIP (Imagenet 16k, RN50x4)
-
Text
- GPT4All Chat - run local windows/linux/mac app like ChatGPT
- oobabooga text-generation-webui - it's like auto1111 sd web ui but for text models
- StableLM space - huggingface space for language model from Stability AI
- Goose.ai Playground - can use their playground to generate text with GPT-Neo
- GPT Neo Colab notebook - use GPT-neo 1.3B and 2.7B from Google colab
- GPT Neo HuggingFace - run GPT-neo 2.7B on HuggingFace
- Neuralism Generative Art Prompt Generator - generate prompts to use for text to image
- OpenAI GPT3 Playground - generate text with GPT-3 (requires free account)
- Textsynth Playground - text completion using large language models
- Dalle-2 Prompt Generator - nice site that let's you generate interesting text prompts
- Prompt Parrot by @KyrickYoung - train GPT2 on a list of your prompts
- MadLib Prompt Generator - makes interesting prompts for you, by @remi_durant
- Noodle Soup Prompts v2.1 by WASasquatch
- Neuralism Prompt Generator - generative art prompt generator
-
Text to Image
There are a TON of shared Google Colab notebooks floating around for doing text to image with pre-trained GAN and diffusion models. I've been compiling the ones I come across and try out and find interesting.
- Stable Diffusion WebUI by automatic1111 - run SD local with lots of features and extensions
- StableStudio - local webui using Stability API for inference
- Deforum Stable Diffusion 0.7 - group effort for ultimate SD notebook (discord) (youtube tutorial) (guide)
- Disco Diffusion v5.6 by Somnai, gandamu, zippy721 (guide) (new guide) (youtube tutorial)
- Huemin Jax Diffusion 2.7 by nshepperd, huemin_art (guide) (stitching guide)
- pytti-tools v0.10 by DigThatData and sportsracer
- VQGAN+CLIP by remi_durant
- [2023/09/15] llamas.ipynb by @osanseviero QR ControlNet + SD1.5 for optical illusions ( tweet )
- [2023/04/28] DeepFloyd IF (huggingface) (github)
- [2023/04/05] Kandinsky 2.1 Batching+Dynamic prompting Colab by @jrobocat
- [2023/04/03] Kandinsky 2.1 (huggingface) (site)
- [2023/03/23] Image-to-text-to-image Colab by @jrobocat - batch CLIP Interrogator + SD generations
- [2023/03/20] ModelScope text-to-video Colab by @camenduru (youtube) (github)
- [2023/03/18] ModelScope text-to-video huggingface space
- [2023/03/14] Unidiffuser - unified diffusion framework (github)
- [2023/02/20] Stable Diffusion Auto Stitching by @oleg_ai_art (guide)
- [2023/02/15] ControlNet - control Stable Diffusion with extra conditioning (youtube) (huggingface) (github) (models)
- [2023/02/14] Pix2Pix video with coherence by @johnowhitaker - stylize video inputs!
- [2023/01/30] Tune-a-Video - create short text2video sequences (github) (paper)
- [2023/01/21] KLMC2 Animation - @DigThatData's fork with lots of additions
- [2023/01/20] InstructPix2Pix - use text instructions to modify images (huggingface)
- [2023/01/19] Image Mixer by @Buntworthy - mix up to 5 images together with SD
- [2023/01/14] Latent Blending by @j_stelzer - smooth transition between SD latents (github)
- [2023/01/10] Custom Diffusion - fast SD finetune with multiple concepts (github)
- [2022/12/22] Karlo - unCLIP architecture like DALLE-2 (huggingface) (github)
- [2022/12/08] Stable Diffusion KLMC2 Animation by @RiversHaveWings
-
[2022/11/30] BAOAB-limit sampler - new SD sampler that can also make anims hella fast (paper)
- [2022/11/25] Stable Diffusion 2.0 Web UI - by @anzorq (run SD 2.0 in colab using Diffusers)
- [2022/11/24] Stable Diffusion 2.0 w Diffusers - by @amrrs (youtube)
- [2022/11/08] Midjourney v4 Style - (dreambooth SD finetune on midjourney v4 outputs)
- [2022/11/03] All-in-one Private Diffusions Colab - fork and upgrades to WD notebook (website)
- [2022/10/25] Fast Dreambooth by TheLastBen (easy fast finetune of stable diffusion in colab)
- [2022/10/08] Stable Worlds by @NaxAlpha (create panoramas with SD!)
- [2022/09/29] MathRockDiffusion by ethansmith2000 (mods and improvements on Disco) ( guide )( cuts )
- [2022/09/29] robo_diffusion_v1 by @nousr (a DreamBooth fine tune of stable diffusion)
- [2022/09/27] Video Killed The Radio Star Diffusion by @DigThatData (transform music videos from YouTube)
- [2022/09/25] fast-stable-diffusion - automatic111 ui, hlky ui, github (+25% speed and low VRAM)
- [2022/09/18] Doohickey Diffusion by aicrumb (stable diffusion with CLIP guidance, perlin init, lots more)
- [2022/09/18] optimized colab by neonsecret (stable diffusion with nice gradio gui in colab)
- [2022/09/13] Stable Diffusion Batch by visoutre (includes tiled upscaling!) (tutorial)
- [2022/09/11] Easy Diffusion by WASasquatch and NOP (stable diffusion with lots of still image features)
- [2022/09/07] NMKD Stable Diffusion GUI (nice easy Windows GUI for stable by Noomkrad)
- [2022/08/30] Simple Stable Diffusion by @ai_curio (supports prompt weighting)
- [2022/08/29] Stable Diffusion WebUi by @altryne (fancy Gradio UI for stable diffusion)
- [2022/08/28] Prompt Parrot v2.0 by @KyrickYoung (train gpt2 on prompt list then generate with stable-diff)
- [2022/08/23] Stable Diffusion Interpolation by @ygantigravity (animate from own prompt to another!)
- [2022/08/23] Deforum Stable Diffusion (discord link) ?
- [2022/08/23] FunkyHorses Stable Diffusion by Coskaiy/Corran (has neat import from spreadsheet)
- [2022/08/23] NOP's Stable Diffusion Colab v0.19 by NOP#1337
- [2022/08/23] Stable Diffusion Lite by @future__art (prompt queueing and seed mining)
- [2022/08/23] Interactive notebook for Stable Diffusion
- [2022/08/22] Stable Diffusion HuggingFace space by stabilityai
- [2022/08/22] Stable Diffusion notebook (easy to use and batch to gdrive) (tutorial)
- [2022/08/22] Official Stable Diffusion notebook - requires hugging face account
- [2022/08/22] DiscoStream v1.1 by @WASasquatch
- [2022/08/20] Disco Diffusion v5.6 with Inpainting by @cut_pow
- [2022/08/18] DiscoArt [w/ Batch Prompts + GPT3 generator] by Skquark
- [2022/08/16] WAS's Disco Diffusion v5.6-9 Portrait Generator Playground by WASasquatch
- [2022/08/08] Paint Pour Diffusion by @EclecticBeams (diffusion trained on paint pour art)
- [2022/07/31] Huemin Jax Diffusion 2.7 August 2022 by @huemin_art
- [2022/07/30] CLIP Prior + VQGAN by @RiversHaveWings and @jd_pressman (a new VQGAN notebook ?)
- [2022/07/23] Textile Diffusion by @KaliYuga (diffusion trained on textiles)
- [2022/07/21] Floral Diffusion by @jags111 (fine tunes for floral)
- [2022/07/18] Liminal Diffusion v1 by @BrainArtLabs (diffusion trained on liminal photographs)
- [2022/07/18] DifNESfusion 1.35 by @LufiQ (fork or PixelArtDiffusion with NES dataset)
- [2022/07/18] Medieval Diffusion by @KaliYuga (diffusion trained on medieval art)
- [2022/07/17] FeiArt_Handpainted CG Diffusion by @FeiArt_AiArt
- [2022/07/17] Fantasy Diffusion by @LaVista (diffusion trained on fantasy art)
- [2022/07/15] Ukiyo-e Portrait Diffusion by @avantcontra
- [2022/07/15] Lithography Diffusion by @KaliYuga (diffusion trained on lithographic landscapes and portraits)
- [2022/07/06] Disco v5.2 Dynamic Prompting (dynamic prompt variations - tutorial video )
- [2022/07/06] Watercolor Diffusion by @KaliYuga (diffusion trained on watercolor paintings)
- [2022/07/05] EnzymeZoo edits to Huemin Jax Diffusion by @EnzymeZoo (brought over masking from Majesty)
- see older notebooks in the archive
-
Upscaling / Super-resolution
- Gigapixel AI by Topaz Labs (costs $99) <- voted #1
- Real-ESRGAN - ( github ) <- voted #2
- Real-ESRGAN Sber - a nice fine tuned ESRGAN model
- chaiNNer - node base tool that can batch process ESRGAN upscale and more
- Cupscale - Windows GUI for ESRGAN
- Latent-SR - Nightmare Ai latent diffusion super resolution (slow but nice!)
- PASD image super resolution - (github) pixel aware Stable Diffusion
- Neural Love - credit based system for diffusion upscaling
- Stable Diffusion Upscaler - latest and greatest ?
- SuperRes Diffusion - Batch upscaling and super resolution with latent-diffusion
- SwinIR - Hugging Face space
- Upscale Model Database - big set of pretrained models for upscaling different types of content
- Waifu2x (github) - designed for anime / manga
- WaifuXL - newer and beats Waifu2x in quality
- LetsEnhance.io - credit based web service for image super resolution
-
Video
Image to video
- [2023/11/22] Stable Video Colab by @mkshing
Text to video
- camenduru text-to-video Colabs - great collection of Zeroscope, potat1, modelscope notebooks
- AnimateDiff (colab) (github) - short video clubs with your own LoRA
- ModelScope (colab) (huggingface) - super fun but prominant shutterstock watermarks
- Text2Video-zero (colab) (github) (huggingface) (webui ext) - zero shot video from Stable Diffusion
Interpolation
- Video Enhance AI by Topaz Labs - commercial upscaling and frame interpolation <- excellent
- AnimationKit AI - video upscaling and interpolation tool <- great
- FILM colab - by @KyrickYoung has pause, loops, reverse <- my fave FILM
- 3D Ken Burns Effect from single image - animated video from 2D image
- 3D Photo Inpainting - cool 3D effects for 2D images
- Animating Pictures with Eulerian Motion Fields - code not out yet, looks like it'll be awesome
- DAIN colab - depth aware interpolation
- EbSynth - stylize video by giving it ai or hand painted key frames from video
- ESRGAN 4 Video - increase resolution of video with ESRGAN
- FILM: Frame Interpolation for Large Motion - (replicate link) smooth interpolation/morphing
- Flowframes - free Windows tool with patreon option, uses RIFE and other models
- PyTTI-Tools: FILM - @DigThatData 's version of FILM for video frames
- RIFE - smooth interpolation of video to increase frame rate
- Sequence Frame Interpolation - batch version of FILM
- Super Slomo - another way to increase frame rate of video
- Video Art and Styling Tools - by @Coskaiy (style transfer, interpolation, superres, and more)
Animation
- [2022/11/03] FrameSync.xyz - Automate Deforum Keyframe animations with waveforms
- [2022/10/26] Tulpa Prompter by @dreamingtulpa - helper to build animation prompts (tweet)
- [2022/08/15] AnimationPreview by @pharmapsychotic - quickly preview Deforum camera animations
- [2022/08/04] DALL-E 2.5D Depth Warped Zoom by @deKxi
- [2022/03/31] PyDub Audio to Disco Diffusion Keyframe Generator v0.1 by austinhquinn
- [2022/02/26] Wiggle animation key frame generator by @zippy731
- [2022/02/23] audio-reactive-video - by @vsewall2motion, skip video frames based on volume
- Keyframe string generator for AI animation notebooks
- Audio to keyframe string generator for AI animation notebooks
What's Your Reaction?






