Все модели
Полный список отсортирован по популярности на Replicate.
Uses pixray with raw settings.
Detects if a picture has anime face.
Use ideogram-character to face-swap someone into a target image
lmsys/vicuna-7b-v1.3
Generate video
Stable Diffusion 2.0 Preview
Wan 2.2 First and Last Frame using 8-step inference w/ Lightning LoRA
Kokoro is a frontier TTS model for its size of 82 million parameters (text in/audio out).
Whisper transcription plus speaker diarization
Detects any class given class names
Run any ComfyUI workflow on an A100. Guide: https://github.com/replicate/cog-comfyui
Detect AI Generated Text with Fast-DetectGPT
Edit an image with a prompt. This is the hidream-e1.1 model accelerated with the pruna optimisation engine.
Create your own variants of "this is fine" 🔥☕️🐕
Ollama Qwen2.5 72b
Experimental backgrounds generator based on FLUX.
End-to-end AI speech model designed for natural-sounding conversational speech synthesis, with support for context-aware prosody, intonation, and emotional expression.
Generate a new image given any input text with Babes 2.0
Stable Audio Open is an open-source model optimized for generating short audio samples, sound effects, and production elements using text prompts.
Instance-Conditioned GAN
Photorealism with RealVisXL V4.0 Lightning
Ultralytics YOLO11n object detection model with 2.6M parameters. Achieves 39.5 mAP50-95 on COCO dataset. Optimized for real-time inference with 1.55 ms speed on T4 GPU..
Anima Pencil XL v5 Model (Text2Img, Img2Img and Inpainting)
The Yi series models are large language models trained from scratch by developers at 01.AI.
Split a video into frames
Reve's fast image edit model at only $0.01 per edit
The "Overall Best Performing Open Source 7B Model" for Coding + Generalization or Mathematical Reasoning
Use Wan 2.2 Animate to replace a character in a video scene
Image Style Transfer with Text Condition
Whisper AI with channel separation and speaker diarization
Video toolkit – convert, make GIFs, extract audio
Take photos with a disposable camera. Like this? Use this with yourself in it on my app PhotoAI.com
TripoSR: Fast 3D Object Reconstruction from a Single Image
Fast automatic speech recognition (70x realtime with large-v2) with word-level timestamps and speaker diarization.
Activating More Pixels in Image Super-Resolution Transformer
Realistic vision + inpainting + controlnet pose
Zero-shot / open vocabulary object detection
pixray text2image (future branch)
High resolution image Upscaler and Enhancer. Twitter/X: @philz1337x
🗣️ Nvidia + Suno.ai's speech-to-text conversion with high accuracy and efficiency 📝
Virtual dressing room
Kolors Model (Text2Img and Img2Img)
Portraits with stable-diffusion
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching. Voice cloning
works with inpainting and multi-controlnet + single-controlnet || ip-adapter + without ip adapter
Introducing a Lora Instant Training model for crafting stunning 1024x1024 visuals. Train your own Lora Model via zip photos for instant outputs. Try Lora Model using this link: https://replicate.com/zylim0702/sdxl-lora-customize-model.
Removes furniture
Video Smoother: AMT All-Pairs Multi-Field Transforms for Efficient Frame Interpolation
Model for Sound demixing challenge 2023: Music Demixing Track - MDX'23