I don't have A1111 but in ComfyUI using a shared workflow that does base and then refiner, SDXL 0.9 was using 12GB of VRAM and 22GB of ram in Ubuntu for me.
Doing images of 1024x1024~
GPU: AMD RX 6800
Also using Comfy. Have been able to get away with 6GB of VRAM doing 1024x1024 and it took a bit longer but I've done a couple of 1024x2048's and they're coming out good :3
In confy
About 16-17sec to do everything at 1024x1024 with 20steps and 5 refiner.
A1111
About the same, using batch of 4 and then using batch img2img refining.
Just more clicks without extensions etc as getting the same it/s between confy and a1111 - - medvram
and im tryin to make SDXL work on my 1660ti laptop lol , comfyui runs it like 1:30 min for each pic , A1111 can’t even load the vae , however yesterday i saw and update on hugging face page of sd that they chaned to 0.9 vae for sdxl1 , seems like there was an issue with their provided 1.0 vae
It's hard to give precise figures, because there's always tricks to getting a little more or less but from my (admittedly limited) testing SDXL is significantly more demanding, and 10+GB of VRAM is probably going to be the minimum to run it. I don't remember exactly what I was doing but I run on an RTX A4500 card, and I managed to max out the 20GB of VRAM just with one SDXL process, where I can normally run a LORA training and 512x768 size images at the same time.
I can run it on my 3080 10 gig card, but Its ridiculously slow. I HAVE to use --medvram or I get out of memory errors and NaN errors. And I mean ridiculously slow. Loading the model takes a few minutes. Generating an image requires me to minimize the browser window, or stable diffusion just stalls. Switching to the refiner isnt even an option because it takes so long to switch between models.
This is on a 5930K, 32 GB Ram, 3080 10G trying to generate 1024x1024 images.
However with comfyUI, it runs just fine, PC doesnt struggle, and it generates the images in about 40 seconds at 50 steps base, 10 refiner.