Reliability of large-eddy simulations: Benchmarking and uncertainty quantification