Queen's Research Data Centre

Queen's Research Data Centre

site header

Longitudinal Administrative Databank (LAD) - 2016

The Longitudinal Administrative Databank (LAD) - 2016 is now available for download and can be accessed by researchers with approved projects.

Information on LAD to share with researchers:

The LAD is a sample of individual taxfilers with a longitudinal design. Currently data are available from 1982-2016. The frame is constructed from the annual T1 Family File (Annual Estimates for Census Families and Individuals (T1 Family File)) which makes use of information from administrative files. Only individual records that have social insurance numbers can be selected for the LAD and these are sampled at a 20% rate. Also included in the LAD are a set of immigration variables, drawn from the Longitudinal Immigration Data Base (IMDB), relating to information collected at landing, as well as a set of variables describing Tax Free Saving Account usage.

The LAD survey units are individuals but limited information about the characteristics of their family during the reference year is also kept (e.g. spouse/parent, family, and children). No stratification is performed as the sampling weight is equal across all units. The sampling is done once on each record in such a way that if someone is selected in a particular reference year, they will be selected in any other later (or earlier) years in which they are present in the T1 Family File.

Researchers unfamiliar with administrative tax data are cautioned that not all LAD data are internally or externally coherent, in part, because tax data are not subject to the same edit and imputation procedures as survey data. Consequently, many researchers have found it takes some time to become familiar with the LAD and to be able to operationalize it in their research.

Note: Potential researchers would benefit from reading the LAD Data dictionary (or the “technical reference guide” as it will soon be known) as part of their understanding of the database.

Due to the size of the data, it has been compressed and provided in SAS and STATA format.

New wave data will be added to the existing folder, researchers do not need to apply for access to new waves.

For any LAD related questions please contact the Survey Focal Points Sukitha Abeysekera (sukitha.abeysekera@canada.ca) and Peter Kitchen (rdc@mcmaster.ca). All other inquiries can be sent to our generic email address STATCAN.de-mad-rdcdata.STATCAN@canada.ca