# littleyuyu/stackoverflow-question-code-dataset

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/littleyuyu-stackoverflow-question-code-dataset).**

171 stars · 27 forks · Python · NOASSERTION

## Links

- GitHub: https://github.com/LittleYUYU/StackOverflow-Question-Code-Dataset
- awesome-repositories: https://awesome-repositories.com/repository/littleyuyu-stackoverflow-question-code-dataset.md

## Description

StaQC: a systematically mined dataset containing around 148K Python and 120K SQL domain question-code pairs, as described in "StaQC: A Systematically Mined Question-Code Dataset from Stack Overflow" (WWW'18)

## Tags

### Part of an Awesome List

- [Data Mining and Datasets](https://awesome-repositories.com/f/awesome-lists/data/data-mining-and-datasets.md) — Mined Python and SQL question-code pairs from StackOverflow.
