Skip to main content

Unistats dataset

The Discover Uni website provides comparable sets of information about full- and part-time undergraduate courses. It is run by the Office for Students and is designed to meet the information needs of prospective students.

Unistats 2019/20 data was first released at 10:04 on 11 September 2019 - we apologise for the delay in publishing the updated data.

Unistats dataset

As an additional technical resource for analysts and developers, we have made the raw dataset that underlies the Discover Uni website available for download. This incorporates information from the Unistats record. The download is presented as a *.zip file containing the data in both XML format and multiple *.csv files.

Supporting files, documents and information about updates to the data are provided below.

Experimental statistics - Longitudinal Education Outcomes (LEO) data

The Longitudinal Education Outcomes (LEO) data in the Unistats dataset is being published as experimental statistics by HESA, following receipt of the data from the Office for Students. This is available in the LEO.csv file and also between the Salary and Tariff entities in the XML Unistats dataset.

Further information can be found on the OfS website, where you can also provide feedback on this data.

Terms and conditions

The Unistats dataset is free to copy, use, share, and adapt for any purpose.The Unistats dataset is published under the Creative Commons Attribution 4.0 International (CC BY 4.0) licence. You must give appropriate credit (HESA,, provide a link to the licence, and indicate if any changes have been made.

Download the Unistats dataset

When you click the button above, the download will begin. The file is delivered as a compressed archive (*.zip) containing a single XML file, a readme.txt file, and a number of *.csv files.

Supporting files and documents

Unistats record 2019/20: Coding manual for the 2019/20 data collection.

XSD schema file: Unistats output schema - Provides detail of the structure of the data file.

Overview of the dataset: Unistats dataset file structure and description.

Brief introduction to XML format: Using the Unistats output file  

UKPRN codes: UNISTATS_UKPRN_lookup_20160901.xlsx. Please read disclaimers in first tab. This file will not be updated every week. UKPRN information is provided by the UK Register of Learning Providers.

Subject codesLookup table for CAH 1.3.2 (applicable to latest data)

Lookup table for CAH 1.2 (applicable to data from 18/19 to 25 February 2020)

Lookup table for JACS 3.0 (applicable to data from 12/13 to 17/18.

Lookup table for JACS 2.0 (applicable to data from 09/10 to 11/12).

KISAIM codes: List of KISCourse.KISAIM valid entries.

Data updates

Updates to the Unistats dataset will be made as required, and in parallel with the Discover Uni website, when contributing higher education providers wish to update their information, or when HESA or the Office for Students (OfS) make changes to the underlying data. These updates occur weekly on Wednesday mornings. The file name of the *.zip file includes the date and time.

Please see the OfS statement on corrections and revisions to the data presented on the Discover Uni website for a full description of how Unistats data changes throughout the year and how these changes are recorded.

A Jisc mailing list has been set up to provide announcements relating to the Unistats dataset and this will be used to notify users of changes to the specification. If you wish to add your name to this mailing list please visit the Jisc website, click on ‘Subscribe or Unsubscribe' and provide your name and email address. Alternatively you can send an email to [email protected] with the Subject left blank and the Message: SUBSCRIBE KIS-UPDATE-ALERT Firstname Lastname

If you have any questions on the dataset, please email HESA’s Official Statistics team, or call +44 (0)1242 211 494

Using older Unistats data

If you are using an older version of the Unistats dataset (downloaded before 26 February 2020), please use the supporting files for the relevant collection year:

Unistats Collection 2019/20 for dataset downloaded 2019-09-11 to 2020-02-25 [Please note, due to updates to the Common Aggregation Hierarchy, there are two versions of the Unistats output schema - one for use prior to 26 February 2020, and one for use from 26 February onwards]

Unistats Collection 2018/19 for dataset downloaded 2018-09-01 to 2019-09-10.

Unistats Collection 2017/18 for dataset downloaded 2017-09-04 to 2018-08-29 [Please note, due to the inclusion of LEO data, there are two versions of the Unistats output schema - one for use prior to 5 July 2018, and one for use from 5 July to 29 August 2018]

KIS Collection 2016/17 for dataset downloaded 2016-09-01 to 2017-09-03 [Please note, due to changes in the TEF, there are two versions of the Unistats output schema - one for use prior to 22 June 2017, and one for use from 22 June to 3 September 2017].

KIS Collection 2015/16 for dataset downloaded 2015-09-03 to 2016-08-31

KIS Collection 2014/15 for dataset downloaded 2014-08-28 to 2015-09-02

KIS Collection 2013/14 for dataset downloaded 2013-09-19 to 2014-08-27

KIS Collection 2012/13 for dataset downloaded before 2013-09-19