The only thing I learned in the last year that you can't really benchmark llms a...

		Zetobal on April 10, 2024 \| parent \| context \| favorite \| on: GPT-4 Turbo with Vision is a step backwards for co... The only thing I learned in the last year that you can't really benchmark llms at all. Above a certain level it's just edge case against edge case or script kiddies and multi billion corps optimizing their fine tune against the test.