Grants

Grants

Alternative names: NSF

This dataset includes funding grants from the National Science Foundation. The task is to predict the award amount.

Original source: data.mendeley.com

Versions

  • Grants (by Jan Motl)

    • Removed DBLP tables as they were too loosely interrelated

Dataset details

Associated task:
Regression
Domain:
Education
Data types:
Size:
890.8 MB
Count of tables:
12
Count of rows:
2,914,549
Count of columns:
46
Missing values:
No
Compound keys:
Yes
Loops:
No
Type:
Real
Instance count:
385,882
Target table:
awards
Target column:
award_amount
Target ID:
award_id
Target timestamp:
award_effective_date

How to download the dataset

The datasets are publicly available directly from MariaDB database.

  1. Open your favourite MariaDB client (MySQL Workbench works, but see FAQ)
  2. Use following credentials:
    • hostname: relational.fel.cvut.cz
    • port: 3306
    • username: guest
    • password: ctu-relational
  3. Export "Grants" database (or other version of the dataset, if available) in your favourite format (e.g. CSV or SQL dump).