qwen3-coder-30B-A3B supports FIM and should be faster than the 7B if you got the...

		Mostlygeek 55 days ago \| parent \| context \| favorite \| on: Ask HN: Who uses open LLMs and coding assistants l... qwen3-coder-30B-A3B supports FIM and should be faster than the 7B if you got the vram. I use bartowkski’s Q8 quant over dual 3090s and it gets up to 100tok/sec. The Q4 quant on a single 3090 is very fast and decently smart.