Все модели
Полный список отсортирован по популярности на Replicate.
Surrealist digital art featuring whimsical, anthropomorphic characters with exaggerated textures and vibrant color blocking
Separate instruments and/or vocals from any song.
🔊 Text-Prompted Generative Audio Model Topics Resources
Flux fine-tuned to write and draw in condensation
Flux lora, use "Y2K" to trigger image generation
One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale
URPM V1.3 Model (Text2Img, Img2Img and Inpainting)
Step-Audio-TTS-3B represents the industry's first Text-to-Speech (TTS) model trained on a large-scale synthetic dataset utilizing the LLM-Chat paradigm
A flux fine-tune based on neo-impressionism
Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
CSM (Conversational Speech Model) is a speech generation model from Sesame that generates RVQ audio codes from text and audio inputs
Your Daily AI Partner
Upscale pictures and videos using the nunif repo (formerly waifu2x).
explore img2img zooming sdxl
🫦 Realistic facial expression manipulation (lip-syncing) using audio or video
Generate an image using text by visualizing CLIP features.
Title updated from TypeScript code!
Text to image prompt
SDXL model trained on the cult movie AKIRA
Model that generates Cartoon like characters
Qwen-VL-Chat but with raw ChatML prompt interface and streaming
Idefics2 is an open multimodal model that accepts arbitrary sequences of image and text inputs and produces text outputs
A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
An implementation of ByteDance/SDXL-Lightning-8step
Precise AI-powered product cutout with 256-level transparency for eCommerce
A 70 billion parameter Llama tuned for coding with Python
RESEARCH/NON-COMMERCIAL USE ONLY: diffusion-based audio-driven expressive talking head generation
Clean videos by automatically removing text overlays
Tuning-free framework to achieve high appearance and temporal consistency in video editing
SDXL fine-tuned on Santa Hats
camcorgi generates photos that feature the cam corgi known as @corgi.cam
Generate 3D assets using text descriptions
Text-conditional image generation model based on OpenAI's unCLIP
Amphion Singing Voice Conversion: DiffWaveNetSVC
Conceptual image-to-image model for Stable Diffusion 1.5
SVFR: A Unified Framework for Generalized Video Face Restoration
create cinematic natural neon film look
Animate any character, humans, cartoons, animals, even non-humans, from a single image + driving video