Zero Cost GPU ๐ค Spaces - ๐Zero๐งโ๐ผHot๐ฅ
Generate realistic dialogue from a script, using Dia!
|
nari-labs
๐ฏ
881
|
InstantX
๐ข
307
Generate customized images using text and multiple images
|
bytedance-research
โก๏ธ
630
|
nvidia
โก
139
Edit an image based on the given instruction.
|
stepfun-ai
๐ป
134
|
lisonallen
๐ฌ
139
Text-to-3D and Image-to-3D Generation
|
tencent
๐
2424
New Ghibli EasyControl model is now released!!
|
jamesliu1217
๐ฆ
1365
|
retwpay
๐ผ
110
|
lllyasviel
๐
3416
|
black-forest-labs
๐ฅ๏ธ
8161
|
multimodalart
๐
1491
Ultra fast high quality image generation
|
Efficient-Large-Model
๐
361
|
not-lain
๐w๐
1675
|
Yuanshi
๐จ
313
|
black-forest-labs
๐๏ธ๐จ
4703
Scalable and Versatile 3D Generation from images
|
theseanlavery
๐ข
95
|
VAST-AI
๐
57
Upgraded to v1.0!
|
hexgrad
โค๏ธ
2529
|
VAST-AI
๐ฎ
697
Controllable Zero-Shot Voice Imitation
|
amphion
๐
54
Chat with Microsoft's 1.58bit Bitnet model!
|
suayptalha
๐พ
56
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
|
mrfakename
๐ฃ๏ธ
2233
|
CompVis
๐ฅ
40
|
Shakker-Labs
๐ฅ
66
Easily expand image boundaries
|
fffiloni
๐
2039
|
Zeyue7
๐
54
Dia - 1.6B Text-to-Dialogue Model
|
mrfakename
๐
33
|
tonyassi
๐ฃ๏ธ
1973
255+ Impressive LoRA's For Flux.1
|
prithivMLmods
๐ฅณ
846
Clarity AI Upscaler Reproduction
|
finegrain
๐ผ๏ธ๐ช
1446
LiveCC-7B-Instruct
|
chenjoya
๐
30
Apply the motion of a video on a portrait
|
KwaiVGI
๐คช
3357
|
fancyfeast
๐
1242
High-fidelity 3D Geometry Generation from images
|
Stable-X
๐ข
565
Flexible Photo Recrafting While Preserving Your Identity
|
ByteDance
๐ธ
910
morpheus tts - uncensored
|
MrDragonFox
๐
24
MidJour | A RealVisXL_Turbo | IRL HI-Res Images Gen
|
mukaist
๐๏ธ
2863
FLUX.1 RealismLora
|
DamarJati
๐
1245
|
jasperai
๐
1344
|
MohamedRashad
๐
37
|
multimodalart
๐บ
1856
Generate stunning high quality illusion artwork
|
AP123
๐
5139
|
lllyasviel
๐
1169
Fast Images-to-3D Generation within 1 Second
|
tencent
๐ฅ
110
Text-to-terrain model (reflectance and elevation)
|
mikonvergence
๐
25
|
gokaygokay
๐
735
|
hkchengrex
๐
674
Blazingly Fast and Embarrassingly Simple Song Generation
|
ASLP-lab
๐ถ
563
Conversational speech generation
|
sesame
๐ฑ
764
AI Clothes Changer Online
|
jallenjia
๐
195
Precise Background Preservation in Editing
|
xilluill
๐จ
35
SD3.5 in 8-steps with TensorArt TurboX
|
VIDraft
๐
34
MultiImages-to-3D Generation
|
tencent
๐
252
|
InstantX
๐ป
3335
|
VAST-AI
๐ฎ
73
Detect and Identify your Images
|
DawnC
๐ฐ๏ธ
11
Animate Your Pictures With Stable VIdeo DIffusion
|
seawolf2357
โจ๐ฅ
53
|
gokaygokay
๐ป
1241
High quality Images in Realtime
|
KingNish
โก
707
|
Yuanshi
๐
871
|
lllyasviel
๐
724
|
multimodalart
๐๐ผ๏ธ๐
572
Fast image relighting using Latent Bridge Matching
|
jasperai
โจ
287
plug-and-play with visual concepts
|
IP-composer
๐จ
25
|
VIDraft
๐
49
interior design AI - thanks @broyang
|
ginigen
๐
11
|
TheStinger
โก
262
|
TheStinger
๐
142
|
guardiancc
๐
133
remove background from any image
|
briaai
๐ข
667
thought that reasoning into LLMs without modification
|
VIDraft
๐จ
17
ocr images and video understand
|
prithivMLmods
๐
209
|
Plachta
๐ค๐
303
|
hf-audio
๐คฏ
883
Spanish finetune for the original F5 model.
|
jpgallegoar
๐ฃ๏ธ
490
OmniParser, turn your LLM into GUI agent
|
microsoft
๐ข
414
by ZONOS model
|
ginigen
๐
80
|
fancyfeast
๐
15
Reasoning + Multimodal + VLM + Deep Research + Agent
|
VIDraft
๐ฅ
11
Blind vote on HF TTS models!
|
Pendrokar
๐ค๐
356
|
depth-anything
๐
444
|
ameerazam08
๐
199
Scalable and Versatile 3D Generation from images
|
innoai
๐ข
21
|
ostris
๐ฅ๏ธ
66
The Ultimate Anime-themed SDXL model
|
Asahina2K
๐
275
Large Animatable Human Model
|
3DAIGC
โก
325
Try Orpheus TTS here
|
MohamedRashad
๐
180
Large Avatar Model for One-shot Animatable Gaussian Head
|
3DAIGC
โก
77
multimodal,search,image capabilities on par with GPT-4o
|
VIDraft
๐๐ช
15
Create a 3D model from an image in 10 seconds!
|
TencentARC
๐
1429
|
TencentARC
๐ทโ๏ธ
1140
Fast Text 2 Video Generator
|
SahaniJi
โก
23
|
fancyfeast
๐ฌ
1173
flux.1 dev + super realism lora
|
prithivMLmods
๐ฅ
160
|
fantaxy
๐
157
|
ByteDance
๐ผ
749
VGGT (CVPR 2025)
|
๐
161
|
huggingface-projects
๐ฅ
138
|
IndexTeam
๐
52
|
ByteDance
๐
54
Reasoning + Multimodal + VLM + Deep Research + Agent
|
VIDraft
๐ฅ
11
|
nvidia
๐ฌ
6
|
aiqtech
๐ค
7
|
acvlab
๐ป
6
|
hf-audio
๐คซ
653
Creative Upscaler High-Res Image Generation HiDiffusion SDXL
|
radames
๐๐ต๏ธ
397
Create a 1M faces 3D colored model from an image!
|
Wuvin
โก
756
|
Potre1qw
๐ฅ
9
|
yanze
๐ค
1925
a super consistent video depth model
|
tencent
๐ฆ
165
Generate multi-view images from a single image
|
VAST-AI
๐
179
|
andyaii
๐คฆ
11
|
aiqtech
๐ผ
49
Zero Shot voice cloning with llasa 3b (Unofficial Demo)
|
srinivasbilla
๐ฅ
293
A unified multimodal understanding and generation model.
|
deepseek-ai
๐
1953
based by 'Lumina Image 2.0' with Multilingual
|
ginigen
๐ผ
2
|
Steveeeeeeen
๐
372
|
armen425221356
๐
8
A demo of HVI-CIDNet
|
ginigen
๐
11
Structure-Preserving Style Transfer with Canny, Depth & Flux
|
fotographerai
๐ฅ
84
|
moonshotai
๐๐๐ป๐ฌ
41
|
dangtr0408
๐ฆ
10
Create images with Flex.2-preview!
|
suayptalha
๐
5
|
artificialguybr
๐
319
High-quality virtual try-on ~ Your cyber fitting room
|
levihsu
๐ฅผ๐๐
1049
|
ZhengPeng7
๐
221
|
ehristoforu
๐ฅ
1717
Restore blurred or small images with prompt
|
Fabrice-TIERCELIN
๐ท
148
|
thinhlpg
๐
79
Advanced Image Generator
|
KingNish
๐
301
|
lenML
๐ฌ
276
|
tori29umai
๐
248
|
gokaygokay
โก
183
|
lj1995
๐ค
143
|
fantos
๐๏ธ
3
input text, a video from the past to the future
|
ginipick
๐ช
4
|
Phips
๐ฅ
114
Remove/Change background of video.
|
innova-ai
๐ฝ๏ธ
420
Generate images with SD3.5
|
stabilityai
๐
1867
Consistency generation of portrait and subject
|
scepter-studio
๐ช
94
AccuVision Diffusion generated IMAGES
|
ginipick
๐
2
Audio Conditioned LipSync with Latent Diffusion Models
|
Potre1qw
๐
20
|
LittleFrog
๐ข
182
Elegant Janus Multimodal & T2I Demo
|
ginigen
๐
3
Create a 1M faces 3D colored model from an image!
|
hysts-duplicates
โก
6
3D Generation from text prompts
|
cavargas10
๐
6
Portrait Animation
|
VIDraft
โก
36
Ovis2-16B
|
ginigen
๐ฆซ
1
FLUX Hand-written STYLE Genereator
|
ginigen
๐ผ
2
Text2Visual Web Converter with AI Image Generation
|
VIDraft
๐ฅ
5
SD3.5 in 8-steps with TensorArt TurboX
|
multimodalart
๐
123
Huggingface demo of EasyControl
|
jamesliu1217
๐ฆ
69
GitHub Research agent to help you find the best GitHub repos
|
zamal
โก
11
|
TencentARC
๐ฆ
59
Generate image from text prompts
|
ovi054
๐
5
Elevating Ghibli-style AI art beyond ChatGPT's capabilities.
|
seawolf2357
๐ฆ
59
Mapping the world of LLMs using biology inspired tools.
|
nyax
๐ฅ
5
|
abidlabs
๐ฏ
4
|
WensongSong
๐
4
Video Dubbing with Open Source Projects
|
r3gm
๐
743
|
TencentARC
๐ท
1904
Easily remove your videos background!
|
amirgame197
๐๏ธ
301
Ultra-fast Whisper Turbo inference โก
|
mrfakename
โก
38
|
ByteDance
โก
380
|
FoivosPar
๐ฅ
161
Solo Piano Audio to MIDI Transcription
|
asigalov61
๐ฆ
33
Style-Preserving Text-to-Image Generation
|
InstantX
๐
432
Stable Diffusion Finetuned Version
|
Nick088
๐๐จ๐จโ๐ผ
146
|
philipp-zettl
๐
166
|
lllyasviel
๐ป
744
Media understanding
|
lixin4ever
๐ฅ๐ธ๐ฌ
145
Ikea could never
|
broyang
๐
38
Easily remove your videos background!
|
fantaxy
๐๏ธ
78
|
aiqtech
๐ผ
102
|
Etrwy
๐ฅ
4
Apply the motion of a video on a portrait
|
Han-123
๐คช
80
|
fantaxy
๐ค
66
Generate stunning high quality illusion artwork
|
andyaii
๐
20
Image generator/identifier/reposer
|
Shitao
๐ผ
679
Framer: Interactive Frame Interpolation
|
wwen1997
๐
355
|
black-forest-labs
๐๏ธ
264
[ 200+ Impressive LoRA For Flux ]
|
Smiley0707
๐ฅณ
10
Generate characters complete with world, backstory, and imgs
|
Nymbo
โจ๐๐
57
|
xiaozaa
๐ฅ๏ธ
32
|
azhan77168
๐ชถ
14
Would you like to see your character in 360ยฐ?
|
aki-0421
๐
10
โจ[With v1.0.0] Accelerated TTS on Kokoro-82M
|
Remsky
๐ด
266
Demo of GOT-OCR 2.0's Transformers implementation
|
yonigozlan
๐ท๐
74
AI-Powered Research Impact Predictor v2
|
VIDraft
๐
12
|
openfree
๐ผ
11
|
Badger123t
๐คฆ
14
animagine-xl-4.0 Multilingual
|
ginigen
๐
7
|
VIDraft
๐ผ
7
|
VIDraft
๐ผ
6
Scalable and Versatile 3D Generation from images
|
cavargas10
๐
7
Webtoon images generate and add text to image
|
openfree
๐ข
9
Source-code Include
|
ginigen
๐คช
18
based by 'Mixture Of Diffusers SDXL Tiling'
|
fantos
๐
68
FLUX Hand-written STYLE Genereator
|
ginigen
๐ผ
9
FLUX Hand-written STYLE Genereator
|
ginigen
๐ผ
6
|
retwpay
๐ผ
7
A highly visual AI chatbot that transforms every response in
|
VIDraft
๐ฅ
1
Video gen using SkyReels model from HunyuanVideo.
|
1inkusFace
๐
8
|
ginigen
๐
65
Fast Images-to-3D Generation within 1 Second
|
mubarak-alketbi
๐ฅ
8
generated sound from video/text and search. Thanks @MMAUDIO
|
ginigen
๐๐
10
|
VIDraft
๐
7
|
VIDraft
๐
18
Private-BitSix-Mistral-Small-3.1-24B-Instruct-2503
|
ginigen
๐ถ
12
AI web app that transforms photos into Ghibli-style artwork
|
ginigen
๐ผ
41
Create avatars and profile images, turning your memes
|
ginigen
๐ผ
10
multimodal,search,image capabilities on par with GPT-4o
|
ginigen
๐๐
6
multimodal,search,image capabilities on par with GPT-4o
|
Heartsync
๐๐
6
multimodal,search,image capabilities on par with GPT-4o
|
Heartsync
๐๐
5
multimodal,search,image capabilities on par with GPT-4o
|
Heartsync
๐๐
5
multimodal,search,image capabilities on par with GPT-4o
|
Heartsync
๐๐
4
multimodal,search,image capabilities on par with GPT-4o
|
Heartsync
๐๐
6
multimodal,search,image capabilities on par with GPT-4o
|
Heartsync
๐๐
6
multimodal,search,image capabilities on par with GPT-4o
|
Heartsync
๐๐
4
multimodal,search,image capabilities on par with GPT-4o
|
Heartsync
๐๐
4
multimodal,search,image capabilities on par with GPT-4o
|
Heartsync
๐๐
4
multimodal,search,image capabilities on par with GPT-4o
|
Heartsync
๐๐
4
multimodal,search,image capabilities on par with GPT-4o
|
Heartsync
๐๐
4
multimodal,search,image capabilities on par with GPT-4o
|
Heartsync
๐๐
4
multimodal,search,image capabilities on par with GPT-4o
|
Heartsync
๐๐
4
multimodal,search,image capabilities on par with GPT-4o
|
Heartsync
๐๐
4
multimodal,search,image capabilities on par with GPT-4o
|
Heartsync
๐๐
4
multimodal,search,image capabilities on par with GPT-4o
|
Heartsync
๐๐
4
Try Orpheus TTS here
|
Karayakar
๐
8
Official demo for the COP-GEN-Beta model
|
mikonvergence
๐
6
|
VAST-AI
โก
5
|
AngelBottomless
๐ผ
8
|
Fizzarolli
๐ฏ
3
|
huggingface-projects
โ
183
text-to-3D & image-to-3D
|
hysts
๐งข
537
|
skytnt
๐ผ๐ถ
506
|
r3gm
๐
214
|
artificialguybr
๐
159
|
Doubiiu
๐จ
282
|
prs-eth
๐ต๏ธ
397
|
multimodalart
๐ง๐ฟ๐ง๐ฝโ๐ฆฑ
939
Get a music sample inspired by the mood of an image
|
fffiloni
๐บ
504
Edit audios with text prompts
|
hilamanor
๐ง
285
|
Zhengyi
๐
289
|
OpenGVLab
๐
96
The most opinionated, anime-themed SDXL model
|
Asahina2K
๐
1333
High-fidelity Virtual Try-on
|
yisol
๐๐๐
1892
AI filter for your portraits
|
multimodalart
๐จโ๐ค
822
|
LAOS-Y
๐
18
|
nroggendorff
๐
8
Text to Audio (Sound SFX) Generator
|
declare-lab
๐
305
Fast Text 2 Video Generator
|
KingNish
โก
618
High quality image generation in 3 second
|
KingNish
โก
303
Chat with Cognitive Computation models ๐ฌ
|
cognitivecomputations
๐ฌ
126
|
John6666
๐ฆ๐
131
|
artificialguybr
๐ฅ
420
|
artificialguybr
โก
338
|
litagin
๐ ๐๏ธ๐ฅฐ
25
|
JacobLinCool
๐ค
48
|
gokaygokay
๐ป
116
|
BadToBest
๐จ
89
|
SkalskiP
๐ฅ
495
|
FireRedTeam
๐ฌ
43
Upscale an image by 4x using FLUX
|
Nymbo
๐
67
|
Pyramid-Flow
โฑ๏ธ
647
Better AI powered platform to purify your speech signal
|
alibabasglab
๐
215
Generate images fast with SD3.5 turbo
|
stabilityai
๐
380
3D Generation from text prompts
|
gokaygokay
๐ข
68
Schneller, besser, FLUX NF4
|
Sebastiankay
๐ผ
9
|
Menyu
๐ผ
52
|
patriotyk
๐ป
24
Add a logo to anything
|
multimodalart
๐ค
371
|
fffiloni
๐ผ๏ธ
162
Extract garment images from everyday images!
|
rizavelioglu
๐ฅ
49
|
John6666
๐ฅ๏ธ
7
Optical illusions and style transfer with FLUX
|
multimodalart
๐
842
A demo of Indic Parler-TTS
|
ai4bharat
๐
173
Restore blurred or small images with prompt
|
MartsoBodziu1994
๐ท
2
|
fantaxy
๐
52
|
Profakerr
๐ฅ
17
Generate image Furry style
|
Akimitsujiro
๐ป
18
Easily expand image boundaries
|
Perry1323
๐
12
|
deepseek-ai
๐
467
3D generation from sketchs with TRELLIS & sdxl
|
cavargas10
๐๏ธ๐ข
3
HF Space for Mistral-Small-24B-Instruct-2501 running on Zero
|
mmcgovern574
๐จ
5
Scalable and Versatile 3D Generation from images
|
cortwave
๐ข
2
Compare any two VLMs, side-by-side.
|
sflindrs
๐ข
2
|
omlab
๐ฌ
64
|
smirki
๐
3
Liquid demo app
|
Junfeng5
๐ฅ
42
App to transcribe your Speech into Text using HF models
|
mozilla-ai
๐
8
Compare latest VAE's
|
rizavelioglu
๐
40
|
hayas
โก
2
Image to Compositional 3D Scene Generation
|
VAST-AI
๐
183
|
MaverickAlex
๐
2
Generate characters complete with world, backstory, and imgs
|
Jensin
โจ๐๐
6
Stereo image generation
|
FQiao
๐
21
|
hayas
โก
2
|
ds4sd
๐ฆ๐
230
Model for predicting micro-millisecond motions in proteins
|
gelnesr
๐งฌ
7
|
fffiloni
๐
132
|
jzq11111
๐
20
|
AC2513
๐ฌ
2
Generate 3D texture from image
|
VAST-AI
๐ฎ
84
|
Yanrui95
๐
15
|
pngwn
๐๏ธ
19
Try Orpheus TTS here
|
baconnier
๐
2
|
khang119966
๐ฅถโ๏ธ๐ฅถ
3
demonstrating how the model retrieve words by sub-word token
|
Guy24
๐ป
2
|
atlasia
๐ฆ
2
|
Nymbo
๐ฏ
2
|
vdmbrsv
๐
2
RAG AI with GDPR & EDPB PDFs
|
arsiba
๐
2
Robust ID Association for Group Photo Personalization.
|
DamonDemon
๐ธ
2
Edit an image based on the given instruction.
|
innoai
๐ป
2
cellpose is a generalist algorithm for cellular segmentation
|
mouseland
๐ฌ
2
|
cella110n
๐ผ๏ธ
2
|
hysts
๐ฉ
12
emotion recognition
|
hysts
๐ฅ
18
|
hysts
๐ป
43
head pose estimation
|
hysts
๐
8
|
hysts
๐
12
|
AttendAndExcite
๐ป
95
|
hysts
๐
12
|
aipicasso
๐
103
image captioning, VQA
|
hysts
๐
147
|
rizavelioglu
๐
8
|
declare-lab
๐
90
|
joaogante
โก
17
Semantic search through 110M academic publications
|
colonelwatch
๐
14
text-to-video
|
hysts
๐
154
htrflow demo app
|
Riksarkivet
๐ข
40
|
huggingface-projects
๐
472
|
huggingface-projects
๐ฆ
480
text-to-image
|
hysts
๐
377
|
sky24h
๐ค
33
|
MohamedRashad
๐
24
|
๐จ
264
|
sanchit-gandhi
๐ฅ
265
|
r3gm
๐
396
|
hysts
๐
13
|
TheStinger
๐ป
779
LLM, chatbot
|
hysts
๐จ
66
|
hysts
โก
15
|
hysts
โก
18
VQA
|
hysts
โก
31
LLM, chatbot
|
hysts
๐
27
|
hayas
โก
12
|
Zitang
๐ฆ
11
|
ccareaga
๐ป
15
|
fffiloni
๐
53
|
ECLIPSE-Community
๐
14
Draw/upload image and search among WikiART using SigLIP
|
merve
๐
72
A demo of OpenDalle V1.1 on a ZERO GPU.
|
mrfakename
๐ผ๏ธ
410
Magnify subject details and enhance image quality
|
fffiloni
โจ
240
|
r-neuschulz
๐
101
Comparing powerful multilingual zero-shot image clf models
|
merve
๐ข
13
|
naver
๐ฌ
10
|
briaai
โก
57
|
LiheYoung
๐
525
|
tomg-group-umd
๐
67
|
sarulab-speech
๐
11
Analyze context usage in LM generations with model internals
|
gsarti
๐ ๐
15
|
ybelkada
๐
11
|
Locutusque
๐
20
|
ddosxd
๐
34
State-of-the-art open-vocabulary image segmentation โก๏ธ
|
merve
๐ป
96
|
tsujuifu
๐ฉโ๐จ
325
|
briaai
๐ข
44
a model that explains math very well !
|
Tonic
๐๐ค๐จ๐ปโ๐ฌ
35
Realtime Image/Video Gen AI Arena
|
TIGER-Lab
๐
281
|
Ceneksanzak
๐
3
|
briaai
๐ฆ
18
PPSurf converting point clouds to meshes
|
perler
๐ฟ
2
|
MohamedRashad
โ
25
|
briaai
๐
36
|
dylanebert
๐ฆ
117
Powerful foundation model for zero-shot object tracking
|
merve
โก
64
text streaming space using Gemma-7B
|
not-lain
๐w๐
15
|
ByteDance
โก
298
State-of-the-art Object Detection YOLOV9 Demo
|
kadirnar
๐
71
Generate and apply matching music background to video shot
|
fffiloni
๐๏ธ๐บ
75
Generate highly aesthetic images
|
playgroundai
๐
1109
|
sail
๐ฑ
26
|
UNESCO
๐
42
Turns your image into matching sound effects
|
Bils
๐ถ
16
|
hansyan
๐
59
Robust, duration-controllable voice-cloning TTS
|
mrfakename
๐ซ
5
A chat demonstration of BioMistral-7B! ONLY FOR RESEARCH!
|
Artples
๐
7
|
amphion
๐
178
|
KBlueLeaf
๐
90
Generate unique drums track for any MIDI
|
asigalov61
๐ผ๐ถ
9
|
CharlieAmalet
๐
1
|
lemonaddie
๐จ
122
|
CharlieAmalet
๐
3
Text-to-Image
|
artificialguybr
๐ข
108
The Vokan TTS demo!
|
ShoukanLabs
๐
25
View how beam search decoding works, in detail!
|
m-ric
โ๐
138
Image to Video Synthesis
|
TIGER-Lab
๐ฅ
35
|
merve
๐ฅ
148
|
artificialguybr
๐ป
25
|
artificialguybr
โก
30
|
mlabonne
๐ฎ
7
User Friendly Image & Video Upscaler!
|
Nick088
๐ฅ๐น
77
Demo for BRIA 2.3 FAST text-to-image
|
briaai
๐
15
Long-form Musicgen
|
ylacombe
๐ท
22
|
dylanebert
๐ง
59
A retrieval system with chatbot integration
|
not-lain
๐w๐
55
|
Naozumi0512
๐ฌ
6
|
ZJYang
๐
203
Video Editing
|
TIGER-Lab
๐ฅ
70
A Diffusion-free One-Step Visual Perception Generalist Model
|
guangkaixu
๐
13
|
artificialguybr
๐
64
|
ChenoAi
๐ฅ
85
|
devilent2
๐
48
|
WalidBouss
๐
3
|
TencentARC
๐
48
|
MykolaL
๐
105
|
szymanowiczs
๐
55
|
tonyassi
๐ธ๐ป
22
|
multimodalart
๐ป
283
|
IDKiro
โก
15
Multimodal Language Model
|
TIGER-Lab
๐
25
|
ethanweber
๐
75
|
clinteroni
๐ข
69
|
artificialguybr
๐
73
|
kotoba-speech
๐ฅ
19
|
merve
๐ฆ๐ฆ
47
|
merve
๐ฆ๐ฆ
17
|
multimodalart
๐
123
|
nroggendorff
๐ฉ
126
|
PKUWilliamYang
โก
6
|
briaai
๐
59
|
dilightnet
๐ก
6
fill the visual gap
|
prithivMLmods
๐
156
|
ByteDance
๐ฅ
192
|
ByteDance
๐
237
Bambara Translation and Text to Speech with Audio Enhancemen
|
oza75
๐
9
|
Deadmon
๐ง๐ฟ๐ง๐ฝโ๐ฆฑ
9
Future-oriented Anime model
|
aipicasso
๐
34
|
alfredplpl
๐ฆ
8
|
pyvene
๐ค
3
Compare models that generate images ultra fast in 1 step
|
multimodalart
๐ฆถ
127
4k Image from text in 5 second
|
KingNish
๐ฅ
461
High-fidelity Text-To-Speech
|
sanchit-gandhi
๐
29
Juggernaut X V10, a powerful text2image model.
|
Walmart-the-bag
๐
90
|
Vision-CAIR
๐๏ธ๐ฟ
42
Voice conversion framework based on VITS
|
r3gm
โก
175
Meta Llama3 8b with Llava Multimodal capabilities
|
MaziyarPanahi
๐ฅ
88
Vocal and background audio separator
|
r3gm
๐
271
|
JackAILab
๐ฅ
56
Anime model
|
Akimitsujiro
๐จ
71
|
paulengstler
๐ชก
44
Demo for BRIA 2.3 FAST LORA text-to-image
|
briaai
๐
11
|
mii-llm
๐ป
2
|
Delik
๐
40
Compare models that generate images ultra fast in 1 step
|
ChenoAi
โก
48
High-fidelity Virtual Try-on
|
Saad0KH
๐๐๐
3
|
LittleFrog
๐ธ
19
4k Image from text in 5 second
|
scooter7
๐ฅ
1
|
burakcanbiner
โก
2
|
parler-tts
โก
92
Text-to-Image
|
artificialguybr
๐
92
Enhance photo of a document with selected approaches!
|
qubvel-hf
๐
41
Turn yourself into a weeb
|
broyang
๐
70
Fixed fork of the original audio sr!
|
Nick088
๐โซ
48
|
abreza
๐ข
28
|
pablovela5620
๐
54
|
Alpha-VLLM
๐ป
127
Medical Chatbot
|
ruslanmv
๐๐ต๏ธ
12
|
DecoderWQH666
๐ผ
44
|
Tim-gubski
๐ธ
3
Stunning images using stable diffusion.
|
r3gm
๐งฉ๐ผ๏ธ
165
Try PaliGemma on document understanding tasks
|
merve
๐
52
|
huggingchat
๐
73
|
Yardenfren
โก
17
Dressfit
|
patrickligardes
๐๐๐
6
|
Azure99
๐
8
|
ysharma
๐ป
215
Chat llama-cpp-agent that can search the web.
|
poscye
๐ฆ
40
|
huggingchat
๐
22
LLaMAX3 Translator
|
vilarin
๐
39
|
wyysf
๐
183
Feature Matching with Foundation Model Guidance
|
qubvel-hf
๐ฆ
14
|
llamafactory
๐ฌ
8
let's talk about the meaning of life
|
vikhyatk
๐
51
Generate incredible videos using Openai Sora
|
Loomisgitarrist
๐๐ฅ
29
Video Dubbing with Open Source Projects
|
sub314xxl
๐๐ท๏ธ
8
|
vilarin
๐
25
|
Ryukijano
๐ผ
33
|
linoyts
๐๏ธ๐
77
|
linoyts
โก๏ธ๐๏ธ๐
175
Translate fragment ion peaks into sequence of amino acids
|
InstaDeepAI
๐
5
|
Doubiiu
๐ป
979
|
clement-pages
๐ฐ
17
|
dwb2023
๐
5
Fastest high-quality video diffusion model.
|
TIGER-Lab
๐ผ
42
Multi-modal LLM for image personalization
|
yizhezhu
๐
14
|
awacke1
๐
3
Flux.1-lumiere
|
vilarin
๐
47
|
cbhhhcb
๐ข
2
SDXL : Fast Image Generation
|
prithivMLmods
๐น
90
|
andrewkatumba
๐ฆ๐ฆ
3
3D novel view synthesis from any number images!
|
kxic
๐ธ๐ธ๐ธโก๏ธ๐ผ๏ธ๐ผ๏ธ๐ผ๏ธ๐ผ๏ธ
46
|
EvanTHU
๐
41
|
AIGC-Audio
๐ข
21
|
gvecchio
๐งฑ
20
|
arad1367
๐จ
6
|
tori29umai
๐
8
Chat with DeepHermes
|
vilarin
๐
39
|
xichenhku
๐จ
124
|
llamafactory
๐ฌ
18
|
ameerazam08
๐ป
159
Chat llama-cpp-agent that review your code.
|
poscye
๐
23
Stunning images using stable diffusion.
|
John6666
๐งฉ๐ผ๏ธ๐ฆ
145
|
fantaxy
๐ฅ
220
Flux.1 Fill
|
vilarin
๐
51
|
Sergidev
๐
35
|
andyaii
๐ข
11
|
el-el-san
๐ผ
10
A fixed GFPGAN for face restoration
|
Nick088
๐
12
|
jhshao
๐ฅ
56
|
AIGC-Audio
๐
13
|
stabilityai
๐จ
1598
Stable Diffusion 3 Medium with SuperPrompt-v1 Enhancement!
|
Nick088
๐ท๐ผ๏ธ
33
|
Stable-X
๐ฅ
12
Flux-Labs with LoRA
|
vilarin
๐ข
63
|
alfredplpl
๐
26
|
rphrp1985
๐จ
5
Filters | Grid | Collage Style | Quality Styles
|
ijohn07
๐
11
DALLE 4K | A RealVisXL_V3, V4 | HI-Res Images Gen.
|
ijohn07
๐๏ธ
18
|
asoria
๐ป
1
|
Stable-X
๐ต๏ธ
78
|
fffiloni
โก
70
High quality image generation in 3 second
|
ijohn07
โก
6
|
sonalkum
๐
16
|
yuntian-deng
๐
29
Restylize & repose person ID
|
okaris
๐ง๐ปโโ๏ธ
452
|
qubvel-hf
๐
11
|
paint-by-inpaint
๐๏ธ๐งน
23
|
gokaygokay
๐
754
|
SixOpen
๐ฅ
42
Demo of "MotionFix:Text-driven 3D Human Motion Editing""
|
atnikos
๐ป
2
4M: Massively Multimodal Masked Modeling
|
EPFL-VILAB
โก
201
|
ahmed-masry
๐จ
106
|
davanstrien
๐
86
|
andito
๐
61
High-fidelity Virtual Try-on
|
paroksh-mason
๐๐๐
15
Play with & compare Stable Diffusion Models
|
Nick088
๐๐ผ๏ธ
20
|
Plachta
๐๏ธ๐พ๐๐ฃ๏ธ
17
|
kevinwang676
๐จ
1
|
nihalnayak
๐ฌ
10
Document Retrieval
|
manu
๐
117
Demo for MiniCPM-o 2.6 to answer questions about images
|
sitammeur
๐ข
49
|
๐๏ธ
48
Convert audio to subtitles
|
reedmayhew
๐ป
8
|
ymzhang319
๐
119
|
pollyai
๐
2
|
gokaygokay
๐ป
536
Chatbot
|
huggingface-projects
๐ป
101
|
bghira
๐๏ธ
8
|
Felix92
๐ฅ
15
|
bdsqlsz
โก
19
|
ruslanmv
๐ฌ
6
|
alfredplpl
๐ชฝ
5
|
John6666
๐๐ฆ
73
Video captioning/tracking
|
merve
๐
98
MP-SENet is a speech enhancement model.
|
JacobLinCool
๐
12
|
Freak-ppa
๐
3
Chat with aya-expanse-8b
|
vilarin
๐
66
|
SkalskiP
๐ฅ
196
|
wondervictor
๐
39
|
gokaygokay
๐ป
143
Run any Stable Diffusion model with LoRAs
|
habulaj
๐งโโ๏ธ
21
|
votepurchase
๐ผ
3
|
sonalkum
๐
5
|
votepurchase
๐ผ
16
|
aixsatoshi
๐
6
|
bghira
๐ผ๏ธ๐๏ธ
4
|
votepurchase
๐ผ
7
|
sarulab-speech
๐
10
|
Gyufyjk
๐
14
Text-to-Image
|
John6666
๐ผ๐ผ๏ธ๐ฆ
100
a tiny vision language model
|
ThomasSimonini
๐
4
|
gokaygokay
๐
159
|
rphrp1985
๐
2
Stunning images using stable diffusion.
|
eienmojiki
๐
25
Apply the motion of a video on a portrait
|
innoai
๐คช
34
|
gokaygokay
โก
171
|
MohamedRashad
๐จ
156
|
yuntian-deng
๐
9
|
dylanebert
๐ฆ
8
Image Generation
|
prithivMLmods
๐ฅ
185
|
SakanaAI
๐
43
|
bdsqlsz
๐ป
17
|
KBlueLeaf
๐
100
|
chrisgreenx
๐ฌ
2
|
gokaygokay
๐ผ
89
|
JournalistsonHF
๐จ
14
|
nroggendorff
โจ
6
|
bokesyo
๐ฅ
12
|
OzzyGT
๐
45
|
ethanchern
๐
21
|
amphion
๐
26
|
merve
๐
31
|
zheyangqin
๐
133
Chat with Mistral
|
vilarin
๐
116
|
innoai
๐
19
Style-Preserving Text-to-Image Generation
|
Hatman
๐ฉโ๐จ
5
|
multimodalart
๐ผ
149
|
naver
๐
48
|
merve
๐ฅ
79
|
gokaygokay
๐ข
169
|
sky24h
โก
2
|
bghira
๐ซ
5
|
rinna
๐ผ
9
|
cdnuts
๐
9
|
briaai
๐
14
|
gokaygokay
โก
65
|
jiuface
๐
6
High-fidelity Virtual Try-on
|
jjlealse
๐๐๐
3
|
feishen29
๐ผ
69
|
Freak-ppa
๐
16
|
1aurent
โ๏ธ
14
|
Remsky
๐ธ๏ธ
19
Multimodal Image-to-Video
|
maxin-cn
๐ฅ
204
|
JacobLinCool
๐ง
37
Intelligently compare any pair of MIDIs
|
asigalov61
๐
5
Chatbot
|
huggingface-projects
๐ป
80
|
fffiloni
๐
55
Aesthetically Controllable Text-Driven Stylization w/o Train
|
fffiloni
๐จ
187
|
mann-e
๐ฅ
6
Create a 3D model from an image in ~10 seconds!
|
jkorstad
๐
54
|
sky24h
๐
15
|
bunarivenna
๐ฅ
8
|
mimbres
๐ธ
39
|
dan-durbin
๐
6
|
theaiinstitute
โก
9
|
Delik
๐
22
|
relik-ie
๐
48
|
tori29umai
๐ป
15
|
Greff3
๐ฅ
10
|
Nick088
โก
63
|
remyxai
๐
7
|
sahirp
๐
33
The most opinionated, anime-themed SDXL model
|
doevent
๐
5
|
zhengchong
๐
188
|
hamacojr
๐
7
|
๐ข
237
|
unity
๐ป
94
|
ZENLLC
๐ป
11
|
wondervictor
๐
62
|
1aurent
๐
14
|
ragavsachdeva
๐ข
13
FLUX Dev - Controlnet Canny
|
DamarJati
๐ง
228
|
Yehor
๐๏ธ
1
|
khang119966
๐ฅถโ๏ธ๐ฅถ
11
|
ucsahin
๐ค๐น๐ท
11
|
doevent
๐
32
Clarity AI Upscaler Reproduction
|
Svngoku
๐ผ๏ธ๐ช
10
|
Ravenok
โก
1
MidJour | A RealVisXL_Turbo | IRL HI-Res Images Gen
|
Dagfinn1962
๐๏ธ
16
|
nyanko7
๐ฆ
65
|
mPLUG
๐ข
21
|
bokesyo
๐ฅ
31
|
rayli
๐
36
|
yslan
๐
8
Demo for MiniCPM-o 2.6 to answer questions about videos
|
sitammeur
๐ฆ
7
|
maxiw
๐ป
16
|
RaushanTurganbay
๐ป
4
|
fantaxy
๐
18
|
brandonsmart
โฐ๏ธ
61
|
ZENLLC
๐ฃ๏ธ
3
|
doevent
๐ข
14
|
SahaniJi
โก
7
|
sindhuhegde
๐
4
FLUX 4-bit Quantization(just 8GB VRAM)
|
ginipick
๐ฆ๐๐ฆ
370
|
jan-hq
๐ป
114
|
mlabonne
๐ฅ
2
|
modelscope
โก
13
|
NVEagle
๐
61
|
jadechoghari
๐ฆ
11
|
John6666
๐๐ป
131
|
Sony
๐
68
Human parsing model by Meta Reality Labs
|
fashn-ai
๐
62
|
Shivam098
๐
7
Clarity AI Upscaler Reproduction
|
ZENLLC
๐ผ๏ธ๐ช
2
|
Awell00
โก
10
|
artificialguybr
๐จ
10
|
Shakker-Labs
๐ผ
122
Create high-quality HD cutouts with just a text prompt
|
finegrain
โ๏ธ
481
Image Generation using OpenDalle V1.1
|
SunderAli17
๐ป
8
|
fantos
๐ผ
14
|
pundhirdevvrat
๐ป
4
|
fffiloni
๐จ
62
|
xianbao
๐
6
|
MaziyarPanahi
๐ฅ
82
|
wjbmattingly
๐
28
Clarity AI Upscaler Reproduction
|
aliceblue11
๐ผ๏ธ๐ช
1
|
๐
118
|
RED-AIGC
๐ผ
9
|
GanymedeNil
๐ฅ
253
|
dn6
๐ฝ
197
|
showlab
๐
16
|
๐ฆ
81
|
๐ป
89
|
๐
54
Voice Clone Multilingual TTS
|
fantos
๐ฅ
80
|
aiqtech
๐ฝ
12
Multimodal Image-to-Video
|
aiqtech
๐ฅ
11
|
alvarobartt
๐ผ
20
|
AUEB-NLP
๐
4
|
AdrienB134
๐ฌ
24
|
RobinsAIWorld
โฐ๏ธ
3
|
aiqtech
๐ฅ
18
|
Doubiiu
๐จ
113
|
BooBooWu
๐ฆ
5
|
aiqtech
๐ฅธ
27
Clarity AI Upscaler Reproduction
|
victorestrada
๐ผ๏ธ๐ช
2
|
maxiw
๐
94
|
huggingface-meta
๐ข
1
Content-Style Composition (GoGoGo)
|
xingpng
๐๏ธ
129
|
randomtable
๐ผ
5
Flux fashion model
|
fantos
๐
19
|
fantos
โ๏ธ
39
|
OpenGVLab
๐
17
Realvisxl V5
|
ameerazam08
โก
24
|
seawolf2357
๐
14
|
fantaxy
๐
31
|
Shakker-Labs
๐ผ
66
Realvisxl V5
|
seawolf2357
โก
127
|
talalif
๐
6
|
rrg92
โก
19
FLUX.1-Dev Text to Image with LoRA
|
ovi054
๐ป
56
|
Qdssa
๐
8
|
gokaygokay
๐
74
DALLE 4K | A RealVisXL_V3, V4 | HI-Res Images Gen.
|
dryade36513
๐๏ธ
5
Chat with Art 3B
|
freeCS-dot-org
๐
8
|
jiuface
๐ผ
6
|
ThomasSimonini
๐
4
|
kaerez
๐
2
|
Potre1qw
๐
2
|
addsw11
๐ฅ
1
|
addsw11
๐
3
|
jiuface
๐ผ
9
|
maxiw
๐
43
|
OpenSound
๐จ
7
|
John6666
๐ฌ
52
|
rolpotamias
๐
12
|
OzzyGT
๐
312
GOT - OCR (from : UCAS, Beijing)
|
Tonic
๐ฒ๐ซด๐ป๐
176
|
SunderAli17
๐
8
Chat with Pixtral 12B using Mistral Inference
|
ethux
๐
39
|
LittleFrog
๐
34
High quality Images in Realtime
|
ginipick
๐ฌโก
98
|
jotase
๐ท
5
|
mukaist
๐ข
3
Flux Animations(GIF) Generaion
|
fantaxy
๐
47
Create a 3D model from an image in ~10 seconds!
|
mukaist
๐
82
|
briaai
๐
13
High-fidelity Text-To-Speech
|
PHBJT
๐ซ๐ท๐ฅ
15
|
OpenSound
๐ฃ
263
|
multimodalart
๐ผ
150
|
OzzyGT
๐ข
28
StyleTTS2 trained on ukrainian dataset
|
patriotyk
๐
79
Ultra-high resolution image synthesis
|
roubaofeipi
๐ป
233
|
Abhaykoul
๐
17
|
sergiopaniego
๐
28
|
KingNish
๐ฅ
18
|
ai-forever
๐
18
|
panelforge
๐ผ
8
|
GonzaloMG
โก
99
|
sofianhw
๐ค
14
|
GonzaloMG
โก
23
|
GonzaloMG
โก
12
|
jasperai
๐จ
15
|
OzzyGT
๐
298
|
aiqtech
๐
20
|
OpenSound
๐ฃ
49
|
fancyfeast
โก
261
|
krchickering
๐ธ
1
|
RED-AIGC
๐
18
|
nroggendorff
โก
25
|
haodongli
๐
73
Chatbot
|
huggingface-projects
๐ป
109
|
davanstrien
๐โก๏ธ๐
75
|
doevent
๐
5
|
alfredplpl
๐
6
|
okaris
๐ง๐ปโโ๏ธ
32
|
akhaliq
๐
111
|
aiola
๐ป
24
|
ameerazam08
๐
26
|
Abhaykoul
๐ป
15
Ultra-high resolution image synthesis
|
openfree
๐ป๐ป
73
|
arad1367
๐ฝ๏ธ๐ฅ๐น
2
|
jadechoghari
๐ข
61
|
rphrp1985
๐
2
|
Underground-Digital
โก
1
|
atalaydenknalbant
๐
68
Realtime implementation of Whisper large turbo
|
KingNish
๐คฏ
310
|
zamal
๐ฑ
18
Controllable Autoregressive Image Generation
|
wondervictor
๐ป
10
A gradio demo for Posterior-Mean Rectified Flow (PMRF)
|
ohayonguy
๐ผ๏ธ
304
This is an official demo website for open-o1.
|
happzy2633
๐ฌ
112
|
akhaliq
๐
216
AI filter for your portraits
|
lichih
๐จโ๐ค
3
multilingual instruct model verifiably trained on open data
|
Tonic
๐ฆ
7
interact with videos !
|
Tonic
๐๐น
65
Image to โ 2.5D Parallax Effect Video
|
BrokenSource
๐
4
|
A19grey
๐
43
|
haodongli
๐
97
Create a 3D model from an image in 10 seconds!
|
themanfrom
๐
21
|
akhil2808
๐ฌ
4
Apply the motion of a video on a portrait
|
yerang
๐คช
5
|
multimodalart
๐งช
479
|
Dragunflie-420
๐ถ
2
|
Dragunflie-420
๐
10
|
RobinsAIWorld
๐
7
Chat with Mistral-Nemo
|
Leri777
๐
2
diffusion-based Image Restoration model
|
JOY-Huang
๐ผ
86
|
bghira
๐ป
11
Gradio demo of CogView-3-Plus
|
THUDM-HF-SPACE
๐๏ธ
53
Efficient T2V generation
|
TIGER-Lab
๐ฅ
186
This is open-o1 demo with improved system prompt
|
DeFactOfficial
๐ฌ
7
High-fidelity Text-To-Speech
|
freddyaboulton
๐
3
|
MR-Eder
๐คฏ
2
Stable audio open model from Synthio paper.
|
sonalkum
๐
14
|
litagin
๐ฅฐ๐ค๐
17
|
kevinppaulo
๐ค
1
|
rafaaa2105
๐๏ธ
38
|
MeissonFlow
๐
50
Chat with Pixtral 12B Multimodal using Mistral Inference
|
ruslanmv
๐จ
12
Compare fast FLUX
|
multimodalart
๐ป
36
Apply the motion of a video on a portrait
|
yerang
๐คช
1
Ovis1.6-Llama3.2-3B
|
AIDC-AI
๐ฆ
15
Robotics Language-Gesture Video Generation
|
HikariDawn
๐
12
Live Interactive demo for EMOVA with Qwen-2.5 backbone
|
Emova-ollm
๐ฅ
7
use the ESM3 model to predict protein structures
|
MISATO-dataset
๐งฌ๐ชฌ
7
LLM for long context
|
THUDM-HF-SPACE
๐ฌ
4
MaskGCT TTS Demo
|
amphion
๐ป
255
|
ChemFM
๐
1
|
qiuzhi2046
๐ค
9
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
|
abidlabs
๐ฃ๏ธ
19
|
jadechoghari
๐
35
|
kotoba-speech
๐
4
Depth Any Video with Scalable Synthetic Data
|
hhyangcs
โก
33
|
zamal
๐ฌ โก๏ธ ๐ฌ
5
A unified multimodal understanding and generation model.
|
deepseek-ai
๐
153
100+ Impressive LoRA's For Flux.1
|
prithivMLmods
๐ฅ
490
|
qyoo
๐ข
3
VideoLLaMA2-AV
|
lixin4ever
๐
15
Generate images with SD3.5
|
Svngoku
๐
2
|
arad1367
๐ฅ๐ฅ๐ฅ
6
Generate images fast with SD3.5 turbo
|
doevent
๐
12
|
jiuface
๐ผ
3
|
jiuface
๐ผ
2
|
๐ต
88
|
EvanTHU
๐
24
Demo EraX-NSFW-V1.0
|
erax
๐ป
6
|
JacobLinCool
๐ฏ
1
Fotorestauration, DE Beschreibung, Lokal Tutorial folgt.
|
Sebastiankay
๐ผ๏ธ
8
Pixel restauration und upskaling
|
Sebastiankay
๐
5
whisper-multi-model
|
TaiYouWeb
๐
2
MoGe live demo
|
Ruicheng
๐
55
Animagine XL 3.1 generates high-quality anime images
|
TheAwakenOne
๐
6
|
surokpro2
๐
8
|
el-el-san
๐ผ
3
|
el-el-san
๐ผ
57
A Fully Open Multilingual Multimodal LLM for 39 Languages
|
neulab
๐
19
Generate images with SD3.5
|
stabilityai
๐
134
|
sergiopaniego
๐ฅ
10
Talk to Qwen2Audio with Gradio and WebRTC โก๏ธ
|
freddyaboulton
๐ฆ
11
|
omni-research
๐ฌ
27
High-fidelity Text-To-Speech
|
PHBJT
๐ฃ๏ธ
32
|
rombodawg
๐
8
|
jadechoghari
๐ป
182
MaskGCT TTS Demo
|
Svngoku
๐ป
15
(Tongyi Lab) ACE: All-round Creator and Editor
|
scepter-studio
๐ช
208
Convert documents to Markdown or JSON with metadata
|
yasserrmd
๐ข
8
|
Muhammadreza
๐ฅ๏ธ
9
Intelligent system for Multimodal Affective States Analysis
|
DmitryRyumin
๐๐ฒ๐๐ฅ๐ฅด๐ฑ๐ก
3
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
|
Gregniuki
๐ฃ๏ธ
10
|
Menyu
๐ผ
18
|
czd358121692
๐
3
|
tori29umai
๐
112
Blind Image Restoration with Instant Generative Reference
|
fffiloni
๐ฆ
301
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
|
redradios
๐ฃ๏ธ
2
รbertrage den Stil eines Bildes mit IP-Adapter+ & ControlNet
|
Sebastiankay
๐งธ
29
A Foundation Action Model For Generalist GUI Agents
|
maxiw
๐
21
Spanish finetune for the original F5 model.
|
redradios
๐ฃ๏ธ
22
|
THUDM-HF-SPACE
๐จ
1
|
breadlicker45
๐ผ๐ถ
10
StableNormal Turbo Beta
|
Stable-X
๐
17
|
panelforge
๐ผ
8
Clarity AI Upscaler Reproduction
|
Greff3
๐ผ๏ธ๐ช
4
Using colpali and milvus for multimodal search
|
saumitras
๐ป
5
Generate a video based on a text prompt using Mochi
|
thesab
๐ฟ
30
Huggingface space for JanusFlow-1.3B
|
deepseek-ai
๐
212
DimensionX: Create Any 3D and 4D Scenes from a Single Image
|
ShuoChen20
๐
16
|
shuttleai
๐ผ
30
8B instruct model from OpenCoder family.
|
OpenCoder-LLM
๐ฌ
27
1.5B instruct model from OpenCoder family.
|
OpenCoder-LLM
๐ฌ
7
|
OpenSound
๐
9
|
OpenSound
๐
9
Find Anything in 3D: Open-World 3D Part Segmentation Model
|
ziqima
โก
6
|
Aekanun
๐จ
3
Transforms PDFs to Markdown, JSON, and DOCX.
|
arad1367
๐ฆฅ๐ฆฅ๐ฆฅ
8
|
Menyu
๐ผ
6
Protein, molecule & more...
|
jadechoghari
๐งฌ
18
Create 3D mesh by chatting.
|
Zhengyi
๐
135
A 3.2B MMDiT Model distilled from flux-dev
|
TencentARC
๐ผ
15
|
HuggingFaceTB
๐
133
|
litagin
๐
9
|
litagin
๐ข
5
|
NJU
๐
23
Advanced Image Generator
|
mckeeboards
๐
3
Controlling Computers with Small Models
|
AskUI
๐
18
Vegeta's personality and voice cloned
|
santoshr24
โก
2
Compare different LLMs.
|
bazingapaa
๐จ
2
text to image gen | sdxl
|
knaimero
๐ฅ
3
Convert Diffuse Textures to Height and Normal maps
|
NightRaven109
๐
8
|
Gopalag
๐
7
|
Menyu
๐ผ
6
|
Hmrishav
๐
34
Generate image variations
|
black-forest-labs
๐ผ๏ธ
156
Video Super-Resolution with Text-to-Video Model
|
SherryX
๐
98
Depth Control for FLUX
|
black-forest-labs
๐ฉป
86
Canny Edges FLUX.1 control
|
black-forest-labs
๐
72
|
fancyfeast
๐
52
|
tsqn
๐
5
|
xiaozaa
๐ฅ๏ธ
79
|
yslan
๐
84
A Training-free Unified Model for Few-shot VAD
|
FantasticGNU
๐
8
|
showlab
๐ป
223
Clarity AI Upscaler Reproduction
|
Taylor658
๐ผ๏ธ๐ช
23
|
THUDM-HF-SPACE
๐ท
6
|
liruiw
๐
3
|
akhaliq
๐
4
|
yslan
๐
10
|
p1atdev
๐ฌ
1
Authoring Animation-Ready 3D Characters with One Click
|
jasongzy
๐
83
High Quality Inversion and Editing of FLUX and OpenSora.
|
wjs0725
๐ช
63
Easily expand image boundaries
|
MartsoBodziu1994
๐
6
Generate multi-view images with SDXL from texts
|
VAST-AI
๐ผ
22
|
Potre1qw
๐ผ๏ธ
1
|
Menyu
๐ผ
13
Identity-Preserving Text-to-Video Generation
|
JoPmt
๐ฅ
6
Nvidia Sana
|
gen6scp
๐ผ
35
Video Depth without Video Models
|
prs-eth
๐น๐น๐น
42
Efficient Track Anything
|
yunyangx
๐ป
26
Generate images with Switti
|
dbaranchuk
๐
38
|
๐
89
|
chancetophugging
๐
5
Generate anime-style multi-view images from texts
|
huanngzh
๐
104
Belarusian TTS
|
archivartaunik
๐
14
|
TencentARC
๐ธ
35
|
JacobLinCool
๐
1
|
rhfeiyang
๐
8
Latest future oriented generative model
|
aipicasso
๐
9
IndicParler_TTS for Urdu_Punjabi & Sindhi
|
PuristanLabs1
๐ฆ
3
Paligemma2 Detection with Supervision
|
onuralpszr
๐ป
16
|
Mar2Ding
๐ฅ
12
SText to Audio(Sound SFX) Generator
|
fantaxy
๐
145
Generate image variations
|
linoyts
โก๏ธ๐ผ๏ธโก๏ธ
33
|
m-ric
๐
2
caption images using Molmo 7B for natural language prompt
|
quarterturn
๐ข
1
training free image editing with Flux
|
rf-inversion
๐๏ธ๐
28
|
Westlake-AGI-Lab
๐
9
kandinsky_4_flash
|
ai-forever
๐ผ
25
BRIA 2.3 ControlNet Pose
|
briaai
๐
3
|
davidmeikle
๐
3
Gradio demo for FlowEdit: Inversion-Free Text-Based Editing.
|
fallenshock
๐
72
TRELLIS is a large 3D asset generation model.
|
crevelop
๐
5
Generate a video based on a text prompt using Mochi
|
ruslanmv
๐จ
16
Generate Paper Reviews
|
maxidl
๐
8
|
jozee
๐
5
Detect budgerigar gender based on cere color
|
atalaydenknalbant
๐ฆ
12
Tool to generate 3D assets for games
|
MohamedRashad
๐
28
|
sysf
๐
3
|
raymerjacque
๐
8
Easily expand image boundaries
|
raymerjacque
๐
3
FLUX, Image to Texto to Image, VLM
|
Heartsync
๐ฆ
10
Fast Inversion of Rectified Flow for Image Semantic Editing.
|
MagicBag
๐ฅ
87
Image Super-resolution via Diffusion Inversion
|
OAOA
๐
393
ColorFlow: Retrieval-Augmented Image Sequence Colorization
|
TencentARC
๐
96
X-MAS LoRA Flux Image Generator
|
fantos
๐
15
Easily expand image boundaries
|
jallenjia
๐
2
CogAgent-GUI-Demo
|
THUDM-HF-SPACE
๐
5
|
nyuuzyou
โก
2
|
LTT
๐จ
8
Fashion Studio & Virtual Try-on
|
ginipick
๐ค๐ค
29
FLUXllama Multilingual(to be add more languages)
|
ginigen
โก
50
|
jallenjia
๐๐ผ๏ธ๐
5
Scalable and Versatile 3D Generation from images
|
cronos3k
๐ข
5
|
amaai-lab
๐
13
|
rphrp1985
๐ค
3
|
dofbi
๐
1
Open LLM(CohereForAI/c4ai-command-r7b-12-2024) and RAG
|
VIDraft
๐จ
16
|
lsxi77777
๐
13
Gradio Demo of DI-PCG
|
TencentARC
๐
6
Scalable and Versatile 3D Generation from images
|
broyang
๐ข
3
|
Martim-Ramos-Neural
๐ฅ
13
High-fidelity Virtual Try-on
|
NikhilJoson
๐๐๐
6
NOVA Text-to-Image
|
BAAI
๐ผ๏ธ
6
InstantID-XS
|
RED-AIGC
๐ผ
8
|
openfree
๐ผ๏ธ
62
CAD-Recode: Reverse Engineering CAD Code from Point Clouds
|
filapro
๐
8
Detect harms and risks with Granite Guardian 3.1 8B
|
ibm-granite
๐
15
|
scdrand23
๐ฅ
7
|
NightRaven109
๐ข
11
|
mukaist
๐
8
Chat with an Italian Small Model
|
anakin87
๐๐ค๐ฎ๐น
3
Lightning fast 5-sec inpainting and outpainting, uncensored.
|
LPX55
โก
6
|
seawolf2357
๐ผ
11
|
seawolf2357
๐ผ
15
Space for Qwen2-VL-2B-Instruct
|
developer0hye
๐
2
Scalable and Versatile 3D Generation from images
|
kushbhargav
๐
7
|
panelforge
๐ผ
2
|
panelforge
๐ผ
2
Stunning images using stable diffusion.
|
bobber
๐งฉ๐ผ๏ธ
4
MidJour | A RealVisXL_Turbo | IRL HI-Res Images Gen
|
ijohn07
๐๏ธ
3
stable hamster | stable diffusion
|
ijohn07
๐น
5
Generate images with multiple models
|
RageshAntony
๐
4
Generate images with Switti
|
dbaranchuk
๐
143
https://huggingface.co/papers/2501.03006
|
wileewang
๐ป
239
Generate anime faces with conditional flow matching
|
ntt123
๐
2
|
1aurent
๐
83
Qwen2.5-VL-7B-Instruct
|
davidr99
๐
1
|
ostapagon
๐ป
8
|
JackAILab
๐ผ
7
A simple app for doing HTR with various models.
|
wjbmattingly
๐ฅ
47
|
toandev
๐ฃ๏ธ
15
|
Menyu
๐ผ
5
|
yasserrmd
๐
3
|
mukaist
๐
8
Gaze detection using Moondream
|
moondream
๐
163
Qwen 0.5B but QwQ
|
kz919
๐ฌ
11
Space for InternVL2_5-2B
|
developer0hye
๐จ
3
InternVL2_5-8B Space
|
developer0hye
๐
5
Gaze Target Estimation
|
fffiloni
๐
15
|
briaai
๐
6
|
patrickligardes
๐ฅ
3
|
bobber
๐ฌ
3
3D generation from sketchs with TRELLIS & sdxl
|
linoyts
๐๏ธ๐ข
189
Text, Callisto-OCR3
|
prithivMLmods
๐ก
37
A demo for the multilingual LVLM Centurio
|
WueNLP
๐
4
Synthpose Markerless MoCap VitPose
|
stanfordmimi
๐
2
|
hysts
โก
166
FitDiT is a high-fidelity virtual try-on model.
|
BoyuanJiang
๐ฆ
264
Audio Conditioned LipSync with Latent Diffusion Models
|
SunderAli17
๐
5
G2P
|
hexgrad
โก
32
Chat with IBM Granite 3.1 8b Instruct
|
ibm-granite
๐
12
Dense Grounded Understanding of Images and Videos
|
fffiloni
๐จ
37
Video Dubbing with Open Source Projects
|
Dragunflie-420
๐
6
|
baohuynhbk14
๐ฅถโ๏ธ๐ฅถ
7
powered by the LLaMA.cpp backend
|
stagbrook-tech
๐ฌ๐ฅ๏ธ
2
[ 200+ Impressive LoRA For Flux ]
|
DJStomp
๐ฅณ
1
|
khang119966
๐ฅถโ๏ธ๐ฅถ
3
|
khang119966
๐ฅถโ๏ธ๐ฅถ
11
Better AI powered platform to purify your speech signal
|
alibabasglab
๐
5
Creator Friendly Text-to-Video
|
aidealab
๐
13
A french-speaking LLM trained with open data
|
Tonic
๐
8
Demo Playground for Florence 2 MSFT
|
justinj92
๐
1
|
Svngoku
๐ฅ
1
A Multimodal LLM that can interpret ECG images.
|
aidhlab
๐
2
Cartoon Image Generation
|
ginigen
โกโก
37
eBOOK Cover generation
|
ginigen
๐ข๐
54
Demo for Jasco Model Music Stems Generation
|
Tonic
๐ท๐ธ๐น๐บ๐๏ธ๐๏ธ๐๏ธ๐ง
25
|
zamal
๐คฏ
9
|
fancyfeast
๐ฌ
25
VITA-1.5 demo
|
VITA-MLLM
๐
2
Line Art Colorization with Precise Reference Following
|
fffiloni
๐ฅท
50
|
HuggingFaceTB
๐
60
|
TakiTakiTa
๐
3
|
Hammedalmodel
๐
8
ImageGenerator for GWS Technologies
|
gws-technologies
๐๏ธ
2
Generate images with SD3.5
|
Jeff850
๐
3
Text-to-Video
|
RageshAntony
๐ฅ
11
|
gws-technologies
๐ฅ๏ธ
1
MidJour | A RealVisXL_Turbo | IRL HI-Res Images Gen
|
WatchOutForMike
๐๏ธ
1
Generate images with Absynth 2.0
|
benjamin-paine
๐
3
MidJour | A RealVisXL_Turbo | IRL HI-Res Images Gen
|
WatchOutForMike
๐๏ธ
2
|
rafaaa2105
๐
6
Redux
|
nftnik
๐
2
|
ainz
๐ผ
5
|
chheplo
๐ฌ
19
Source-code Include
|
openfree
๐คช
90
|
1aurent
๐ฏ๏ธ
1
|
bobber
๐ฌ
3
kvpress: LLM KV cache compression made easy
|
nvidia
๐
24
Belarusian TTS Demo + stress
|
archivartaunik
๐
6
|
shuttleai
๐ผ
76
|
veltre
๐ฆ๐ฆ
1
Fast Text 2 Video Generator
|
LAJILAODEEAIQ
โก
1
Frontier Foundation Models for Video Understanding
|
lixin4ever
๐ฌ
66
|
openfree
๐ผ
29
|
Jingkang
๐
13
Frontier Foundation Models for Video Understanding
|
lixin4ever
๐ฌ
18
|
veltre
๐ฆ๐ฆ
1
Zero Shot voice cloning with llasa 3b (Unofficial Demo)
|
SunderAli17
๐ฅ
12
(ICLR 2025) https://github.com/qq456cvb/3DCorrEnhance
|
qq456cvb
๐ป
1
|
ginipick
๐
13
|
Aarifkhan
๐
3
A humble space for trying EGTTS V0.1
|
MohamedRashad
๐จ
23
Zero Shot voice cloning with llasa 3b (Unofficial Demo)
|
gorbiz
๐ฅ
2
A unified multimodal understanding and generation model.
|
AP123
๐
103
A unified multimodal understanding and generation model.
|
unography
๐
1
A unified multimodal understanding and generation model.
|
mkozak
๐
6
A unified multimodal understanding and generation model.
|
afrideva
๐
20
Deepseek AI's Janus-Pro-7B: Generate image from text
|
Bils
๐
17
Deepseek AI's Janus-Pro-7B: Generate image from text
|
LLMhacker
๐
63
|
aifeifei798
๐ผ
5
OpenSource Music Generator
|
innova-ai
๐ฉโ๐ค
43
Deepseek AI's Janus-Pro-7B: Generate image from text
|
AnonTnf
๐
3
A test for darija TTS model
|
medmac01
๐
12
|
PramaLLC
๐
182
German zero Shot voice cloning with llasa 1b finetuned
|
SebastianBodza
๐ฅ
8
Space for Qwen2.5-VL-3B and 7B image + text demo.
|
mrdbourke
๐
8
FitDiT is a high-fidelity virtual try-on model.
|
felipevictal
๐ฆ
4
|
ameerazam08
๐
49
Generate images with Lumina Image 2.0
|
benjamin-paine
๐ก
117
|
wuhp
๐ป
2
|
opentyphoon
๐ฌ
3
HF Space with Zurich 14B GCv2 5m
|
rubenroy
๐โก๐
8
Generate compressed images given different input conditions
|
DDCM
๐
9
Zero Shot voice cloning with llasa 3b (Unofficial Demo)
|
srinivasbilla
๐ฅ
35
A personalized image generator
|
nikhilsoni700
๐
1
Languages ru,en,zh-cn,ja,de,fr,it,pt,pl,tr,ko,nl,cs,ar,es,hu
|
luigi12345
๐
5
Use AI to Change Clothing
|
frogleo
๐๐๐
14
|
frogleo
๐ผ
6
|
Aatricks
๐
12
|
Aarifkhan
๐ป
6
|
qubvel-hf
โก
2
Chat with vie-publique.sn
|
dofbi
๐ฌ
1
The Ultimate Anime-themed SDXL model
|
doevent
๐
1
Text-to-3D and Image-to-3D Generation
|
Wkatir
๐
5
|
cavargas10
๐
3
[ 250+ Impressive LoRA For Flux ]
|
CultriX
๐ฅณ
1
|
RageshAntony
๐
2
Coqui-XTTS Text-to-Speech Demo with Vietnamese
|
jimmyvu
๐ข
9
|
Steveeeeeeen
๐
5
|
shb777
๐
28
Scalable and Versatile 3D Generation from images
|
davinciwearables
๐ข
1
based by Kolors Controlnet Pose Tryon
|
ginigen
๐ผ
13
Advanced Document Topic Analyzer
|
VIDraft
๐
4
Create a 1M faces 3D colored model from an image!
|
VIDraft
โก
9
Protecting Protein Generative Models with Watermark.
|
Zaixi
๐
3
|
CerebrumTech
๐ข
1
Powered by Ginigen 3D Style Image, Unique3D
|
ginigen
โก
43
|
yiren98
๐ค
11
|
Alpha-VLLM
๐ผ
137
Video Generator โก from stories
|
ruslanmv
๐
6
|
Fancy-MLLM
๐
12
|
oza75
๐ฌ
2
|
THUdyh
๐
10
|
Perry1323
๐ฅ
4
|
phitran
๐ฅ
2
Generate a video from any number of images
|
kiwhansong
โจ
21
|
sflindrs
๐ฅ
1
|
Ryukijano
๐ฆ
1
|
AlekseyCalvin
๐๐ป
2
image captioning, VQA
|
VIDraft
๐
6
Deepseek Multimodal -Automated Real Anything-to-Anything
|
VIDraft
โป๏ธ
11
3D-aware Video Diffusion for Video Generation Control
|
EXCAI
๐
25
3D Style Image Generator R1: Fast & High Quality Mode
|
ginigen
๐ผ๐
23
Gradio demo of CogView4-6B
|
THUDM-HF-SPACE
๐๏ธ
105
Flux version implementation of LayerDiffuse
|
ginigen
๐ช
9
|
HuggingFaceTB
๐
69
|
rafaaa2105
๐คซ
2
combines extracted subjects with AI-generated backgrounds
|
ginigen
๐๏ธ
7
See, read, and reasonโbetter together.
|
AIDC-AI
๐ฆซ
114
|
ginigen
๐งช
13
Video from images generated from text
|
nezihtopaloglu
๐ข
3
Feat2GS Demo
|
endless-ai
โจ
9
Text to 3D using Flux schnell and Trellis
|
yonnel
๐
1
|
TakiTakiTa
๐
1
Using dataset shb777/gemini-flash-2.0-speech for finetuning
|
HKUST-Audio
๐ฅ
7
Llasa-1B-Multilingual finetuned using simon3000/genshin-voic
|
HKUST-Audio
๐
10
|
ashen0209
๐ผ
23
|
hysts
โก
8
|
hysts
โก
1
Ovis2-16B
|
Mihaiii
๐ฆซ
3
|
Walid-Ahmed
๐
1
Long-form Speech Synthesis with Zonos
|
ginigen
๐
17
Transform Your Images into Mesmerizing Hexagon Grids
|
Surn
๐
4
่ฅฟๅๅทฅไธๅคงๅญฆASLPๅฎ้ชๅฎคOSUM้กน็ฎdemoๅฑ็คบ
|
ASLP-lab
๐ฌ
28
|
JacobLinCool
๐๏ธ
8
Long-Form Speech Synthesis with Zonos and DeepFilterNet
|
benjamin-paine
๐
21
|
ZhiyuanthePony
๐
25
FLUX Hand-written STYLE Genereator
|
ginigen
๐ผ
5
FLUX Hand-written STYLE Genereator
|
ginigen
๐ผ
11
Text-to-3D and Image-to-3D Generation
|
inoculatemedia
๐
1
Small model can do big things.
|
AIDC-AI
๐ฆซ
16
Ovis2-2B
|
AIDC-AI
๐ฆซ
5
Ovis2-4B
|
AIDC-AI
๐ฆซ
12
Ovis2-8B
|
AIDC-AI
๐ฆซ
9
|
hayas
โก
2
test
|
hujiecpp
๐
12
|
hayas
๐ฅ
6
|
cbensimon
๐
12
FLUX Hand-written STYLE Genereator
|
ginigen
๐ผ
47
|
yeq6x
๐ค
3
CrossFlow directly evolves text representations into images
|
QHL067
๐ผ
3
FlexTok flexible sequence length autoencoding demo
|
EPFL-VILAB
๐ผ
5
|
ai-forever
๐ผ
10
Chatbot
|
TheFinAI
๐
1
|
atlasia
๐ค ๐ฒ๐ฆ
11
transformation and timeless creativity in innovative
|
ginigen
๐ผ๐
7
OmniParser, turn your LLM into GUI agent
|
ginigen
๐ข
11
|
ginigen
๐งช
11
|
smirki
๐
4
|
retwpay
๐ผ
2
|
retwpay
๐ผ
1
|
Eshita-ds
๐ฌ
1
|
retwpay
๐ผ
1
Try latest FluentlyLM model!
|
ehristoforu
๐จ๏ธ
3
|
moondream
๐
6
|
ahmed-masry
๐ป
3
A demo of HVI-CIDNet
|
Fediory
๐
17
|
retwpay
๐ผ
3
|
Locutusque
๐บ
8
Huggingface demo of TrajectoryCrafter
|
Doubiiu
๐
52
Try AuraFlow-v0.3 to generate images
|
merterbak
๐ผ๏ธ
3
|
orionweller
๐
3
Image generator/customization/personalization
|
nupurkmr9
๐ผ
94
|
Gemini899
๐ผ
1
|
ameerazam08
๐
100
Stunning images using stable diffusion.
|
Menyu
๐งฉ๐ผ๏ธ๐ฆ
9
Visual q and A
|
Walid-Ahmed
๐ข
1
Magma-8B model for UI Agents
|
microsoft
๐
27
Anime Line Extractor
|
aidenpan
โก
4
Ufcas transcription
|
Ticsocial
๐ฅ
1
Long-VITA Demo
|
shenyunhang
๐
2
RAG example using Granite [vision, embedding, instruct]
|
ibm-granite
๐
15
Wan: Open and Advanced Large-Scale Video Generative Models
|
azhan77168
๐ป
6
|
tight-inversion
๐ป
73
Large Language Diffusion Models
|
multimodalart
๐
156
Prompt with Images in flux[dev]
|
Hatman
๐ผ
5
Space demoing Phi4 MultiModal
|
ariG23498
๐ฆ
55
|
xingyang1
๐ป
138
|
zhang3z
๐ฅ
5
Mixture of Diffusers and ControlNet Tile Upscaler for SDXL
|
elismasilva
๐
22
Chat with Microsoft's phi-4 or phi-4-mini models
|
merterbak
๐จ
11
(Unofficial) Gradio demo for Spark-TTS
|
thunnai
โก
21
|
abdull4h
๐
2
Demo for Attention Distillation
|
ccchenzc
๐
10
|
innoai
๐ฅ
2
@image @rAgent @web @text @tts1 @tts2
|
VIDraft
๐
20
Large Language Diffusion Models
|
ginigen
๐
2
Upsample Captions using Cosmos 1.0
|
benjamin-paine
๐
2
|
YiftachEde
๐ฎ
4
|
Menyu
๐ผ
5
|
snyderline
โฑ๏ธ
1
|
rootglitch
๐ป
1
Clarity AI Upscaler Reproduction
|
evgeniy09
๐ผ๏ธ๐ช
2
NAMAA Qari Arabic OCR Model Demo
|
oddadmix
๐ข
21
Demo for Multimodal-SAE
|
lmms-lab
๐ฌ
8
Convert handwritten notes to digital format using AI
|
ZennyKenny
โ๏ธ ๐ ๐๏ธ
6
|
PiperMy
๐ค
2
Tuning-free subject-driven generation
|
primecai
๐ฆ
190
Demo for Generative Photography
|
pandaphd
๐
2
A VLM-based message decoder that is trained via GRPO
|
Groundlight
๐
8
Unofficial sbintuitions/sarashina2.2-3b-instruct-v0.1
|
alfredplpl
๐ฌ
2
|
JinhuaL1ANG
๐
5
|
tight-inversion
๐จ
66
Zero Shot voice cloning with llasa 3b (Unofficial Demo)
|
zouyunzouyunzouyun
๐ฅ
4
Translate Vedic Sanskrit to English
|
diabolic6045
๐ต๏ธ
3
The Ultimate Anime-themed SDXL model
|
DamarJati
๐
5
A demo of Indic Seamless Model
|
ai4bharat
๐
9
|
dwb2023
๐
2
Use multi-view diffusion to enhance low-quality 3D assets
|
yslan
๐
9
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
|
eBlessings
๐ฃ๏ธ
1
Transform Prompts to PlantUML diagrams
|
vinzur
โก
3
A mini project of sign language conversation
|
JaiSurya
๐
1
D-FINE Inference Example
|
developer0hye
๐
1
ponix can go anywhere
|
cwhuh
๐ผ
1
Visualize texts in images using BPE tokens
|
TongkunGuan
โก
2
|
jameslahm
๐
51
demo of LLMs fine-tuned for concise reasoning
|
tergel
๐๏ธ
4
|
VIDraft
๐จ
3
|
hanzla
๐
3
State-of-the-art Indic language translation by AI4Bharat
|
ai4bharat
๐
27
BRIA-3.1
|
briaai
๐ข
6
|
ovi054
๐ฅ๏ธ
1
|
yuyutsu07
๐
1
A chat interface to use Reka Flash 3 OSS apache model
|
ZoroaStrella
๐ฌ
4
Space demoing Phi4 MultiModal
|
gizemsarsinlar
๐ฆ
2
Tokenization demo for arxiv.org/abs/2503.08685
|
tennant
๐
1
auto-regressive generation demo for arxiv.org/abs/2503.08685
|
tennant
๐ป
1
Consistency generation of portrait and subject
|
r4ruixi
๐ช
1
This is the space for Senorita-2M editing model
|
PengWeixuanSZU
๐
3
Demo for Amodal3R reconstruction
|
Sm0kyWu
๐ผ
18
|
hanzla
๐
3
SD3.5 in 8-steps with TensorArt TurboX
|
skjaini
๐
1
|
mukaist
๐ฅ
1
Speech-to-Text Transcription for the moroccan darija dialect
|
BounharAbdelaziz
๐๏ธ
2
A state of the art English to Moroccan darija translation
|
atlasia
๐ฅ
3
|
hanzla
๐
7
Conversational speech generation
|
tzmartin
๐ฑ
1
Blazingly Fast and Embarrassingly Simple Song Generation
|
dskill
๐ถ
1
|
nightey3s
๐ซ
2
|
ovi054
๐ป
4
Chat with multimodal gemma-3-12b-it or gemma-3-4b-it models
|
merterbak
๐
8
The Official Demo of Soundwave
|
puccho
๐
7
Chat with text or text+image (using Gemma3)
|
Didier
๐
1
|
Svngoku
๐
1
A text-to-speech model powered by SparkAudio and Mobvoi.
|
amortalize
๐
1
|
dbaranchuk
๐ผ
33
Canary 1B Flash demo
|
nvidia
๐ค
30
|
retwpay
๐ผ
1
|
raymerjacque
โก
1
|
markury
๐ป
2
Classical Japanese Chatbot
|
SakanaAI
๐ป
15
|
wjbmattingly
๐
1
UniK3D (CVPR 2025)
|
lpiccinelli
๐ข
26
Switch lamps on in your images.
|
finegrain
๐ก
43
Audio-driven Talking Portrait
|
ChaolongYang
โก
5
Dereflection Any Image
|
sjtu-deepvision
๐
4
|
khang119966
๐ฅถโ๏ธ๐ฅถ
3