This is a performance testing framework for Spark SQL in Apache Spark 2.2+. The framework contains twelve benchmarks that can be executed in local mode. They are organized into three classes and ...
EHRSQL is a large-scale, high-quality dataset designed for text-to-SQL question answering on Electronic Health Records from MIMIC-III and eICU. The dataset includes questions collected from 222 ...