Все модели

Step-Audio-TTS-3B represents the industry's first Text-to-Speech (TTS) model trained on a large-scale synthetic dataset utilizing the LLM-Chat paradigm

1 143 запусков

fofr/flux-neo-impressionism

A flux fine-tune based on neo-impressionism

1 138 запусков

huzaifaqadeer/paintedflux

1 137 запусков

chenxwh/deepseek-vl2

Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

1 136 запусков

lucataco/csm-1b

CSM (Conversational Speech Model) is a speech generation model from Sesame that generates RVQ audio codes from text and audio inputs

1 133 запусков

babysea/avatar

Your Daily AI Partner

1 132 запусков

do-not-follow-me/chxdkuz

1 132 запусков

shreejalmaharjan-27/waifu2x

Upscale pictures and videos using the nunif repo (formerly waifu2x).

1 129 запусков

anotherjesse/sdxl-recur

explore img2img zooming sdxl

1 128 запусков

zsxkib/v-express

🫦 Realistic facial expression manipulation (lip-syncing) using audio or video

1 128 запусков

laion-ai/deep-image-diffusion-prior

Generate an image using text by visualizing CLIP features.

1 127 запусков

usamaehsan/voices

1 127 запусков

zeke/hello-world

Title updated from TypeScript code!

1 126 запусков

qr2ai/ar

Text to image prompt

1 122 запусков

doriandarko/sdxl-akira

SDXL model trained on the cult movie AKIRA

1 116 запусков

lkincel/tinytales

Model that generates Cartoon like characters

1 113 запусков

aisha-ai-official/boobai-v2.0-alpha

1 112 запусков

nomagick/qwen-vl-chat

Qwen-VL-Chat but with raw ChatML prompt interface and streaming

1 105 запусков

lucataco/idefics-8b

Idefics2 is an open multimodal model that accepts arbitrary sequences of image and text inputs and produces text outputs

1 104 запусков

tencent/hunyuandit-v1.1

A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

1 104 запусков

re-mix-1/sdxl-lightning-8step

An implementation of ByteDance/SDXL-Lightning-8step

1 101 запусков

bria/product-cutout

Precise AI-powered product cutout with 256-level transparency for eCommerce

1 098 запусков

pieceof/comfyui-docker

1 098 запусков

meta/codellama-70b-python

A 70 billion parameter Llama tuned for coding with Python

1 096 запусков

cjwbw/dreamtalk

RESEARCH/NON-COMMERCIAL USE ONLY: diffusion-based audio-driven expressive talking head generation

1 093 запусков

hjunior29/video-text-remover

Clean videos by automatically removing text overlays

1 093 запусков

tiger-ai-lab/anyv2v

Tuning-free framework to achieve high appearance and temporal consistency in video editing

1 093 запусков

fofr/sdxl-santa-hat

SDXL fine-tuned on Santa Hats

1 092 запусков

hilongjw/fast-sam

1 092 запусков

yosun/camcorgi

camcorgi generates photos that feature the cam corgi known as @corgi.cam

1 088 запусков

adirik/mvdream

Generate 3D assets using text descriptions

1 086 запусков

cjwbw/karlo

Text-conditional image generation model based on OpenAI's unCLIP

1 086 запусков

lucataco/singing_voice_conversion

Amphion Singing Voice Conversion: DiffWaveNetSVC

1 086 запусков

falta-studio/tensor

1 083 запусков

settyan/flash-v2.0.2-beta.10

1 083 запусков

vivalapanda/conceptual-image-to-image-1.5

Conceptual image-to-image model for Stable Diffusion 1.5

1 083 запусков

yodagg/sam3-image-seg

1 078 запусков

zsxkib/stable-video-face-restoration

SVFR: A Unified Framework for Generalized Video Face Restoration

1 077 запусков

buildingwithai/ai-jo

1 075 запусков

torrikabe/cinematic_neon

create cinematic natural neon film look

1 073 запусков

bytedance/dreamactor-m2.0

Animate any character, humans, cartoons, animals, even non-humans, from a single image + driving video

1 069 запусков

Назад Страница 41 из 115 Вперед