Census Microdata: findings and futures

conference  

Humanities Bridgeford Street, University of Manchester

  conferencedelegates
 
 
 
1 - 3 September 2008
 

Abstracts


Day one: Monday 1 September 2008
 

The future of microdata

Professor Denise Lievesley, Special Advisor, United Nations Economic Commission for Africa

Denise will talk about the importance of preserving and providing access to well documented microdata from population censuses, other censuses and key surveys in order to support evidence-based decision and policy making within countries. She will look at this issue from a global perspective, focussing particularly on Africa. She will discuss some of the barriers which have prevented the development of access to microdata and will highlight recent developments (in relation to the establishment of an Association of African data archivists, the re-writing of national statistical legislation and the development of national statistical strategies) designed to improve the current situation whereby census data are still under-utilised within so many countries. Denise will also discuss the empowerment of the user community which can result from access to census data.

 

International Microdata Revolution

Managing Microdata Access and Statistical Confidentiality: A review of the IPUMS-International project
Dennis Trewin, Statistical Consultant

The 2003 Conference of European Statisticians was the first occasion the heads of National Statistical Offices collectively considered the question of the provision of microdata to support research. Importantly they agreed that supporting research is an important activity for National Statistical Offices (NSOs) and that most NSOs could do more to satisfy these needs. Doing more included providing improved access to microdata. They commissioned work which led to the publication of Guidelines on Confidentiality and Microdata Access (published on the web site of the Conference of European Statisticians). The Guidelines were later adopted by the UN Statistical Commission for international use. I was the leader of the Task Force which undertook the work and author of the Guidelines.

In this light I was contracted by the managers of IPUMS-International to review its conformance with international best practice. IPUMS-International is one of the world’s largest repository of microdata. The talk will highlight the main findings of the review. It will also attempt to draw out the main learnings for other repositories of microdata.

 

Intergenerational Coresidence and Economic Development: New Evidence from the International Integrated Public Use Microdata Series
Steven Ruggles and Misty Heggeness, Minnesota Population Center

We use newly-available census microdata from IPUMS-International to assess trends in intergenerational coresidence in 15 developing countries. Contrary to expectations, we find no general decline in intergenerational coresidence over the past several decades. There have been, however, significant changes in the configuration of intergenerational coresidence. Families in which the older generation is household head—a configuration consistent with traditional patriarchal forms in which the older generation retains authority—are becoming more common in most of the countries. Intergenerational families headed by the younger generation—the configuration one would expect if intergenerational coresidence were motivated by a need for old-age support—are on the decline in most of the countries. Multivariate analysis reveals that intergenerational families headed by the older generation are positively associated with measures of economic development. These findings are at variance with widely-accepted social theory. We hypothesize that housing shortage, economic stress in the younger generation, and old-age pensions may contribute to the change. More broadly, in some developing countries rising incomes may have allowed more people to achieve their preferred family structure of intergenerational coresidence following traditional family forms.

Lessons learned from cross-national research on marital homogamy
Albert Esteve Palós and Luís Ángel López, Center for Demographic Studies, University Autonomous of Barcelona

Full abstract (Word)

This paper discusses the advantages and drawbacks of carrying out cross-national research on marital homogamy using census microdata. The integration of census microdata at the international level by the IPUMS projects (Minnesota Population Center) has opened the door for cross-national research on a broad range of topics, including assortative mating. Census microdata are an invaluable resource for social science research. Other sources—such as demographic and labor force surveys—often offer greater subject coverage and detail than do census data, but no alternate source offers comparable sample density, chronological depth, and geographic coverage.

The Census-project: Evaluating the IECM base and enhancing users' access
Sabine Springer, CENSUS-Project, French National Institute for Demographic Studies

Full abstract (Word)

The overall objective of the CENSUS project consists in the evaluation of the coverage and quality of the Integrated European Census Microdata base through demographic methods. This database, which is still under construction, has been built in cooperation of the Minnesota Population Center and the Centre d’Estudis Demogràfics of Barcelona. The countries already available and which will be considered in the CENSUS project are Belarus, France, Greece, Romania, Spain, Hungary and Portugal, a total of 7 countries and 22 censuses. The final outcome will be guidelines for good practices for potential users of the database prepared by the three participating research groups: Research Unit for International Comparability (INED), the European Demographic Observatory and the Institute for demographic studies of the University of Bordeaux. The project is financed by the National Research Agency of France (ANR).

 
Beyond cross sections: Insights from analysis of longitudinal microdata
 

Health inequalities using Belgian census and linked mortality data
Patrick Deboosere, Department of Social Research, Vrije Universiteit Brussel

Administrative data is not designed for research but rather a by-product of “administrative bookkeeping”. Computerization has played a major role in the explosion of the research potential of administrative data for demography. The centralization and computerization of the Belgian municipal registers in the National Population Register became fully operative in 1988 and boosted a new research field on the analysis of administrative data. The linkage of core socio-demographic data from the census with event data of the national population register has been applied extensively in several Scandinavian countries proving how this kind of research design contributes to evidence based scientific progress in demography, epidemiology and sociology. As the cost of linking data is relatively small and resulting databases are highly informative, the re-use of administrative data for statistical purposes for research and policy has become increasingly popular. Monitoring evolutions is becoming crucial in our complex modern societies in order to develop efficient policies. Population health is no exception in this regard. The inclusion of a small set of health questions in the Belgian census and the linkage with mortality data demonstrates the potential of administrative data combined with simple single-item questions for the monitoring health inequalities.

Comparing results from the England and Wales, Scotland and Northern Ireland Longitudinal Studies
Harriet Young and Emily Grundy, London School of Hygiene and Tropical Medicine, Paul Boyle, St Andrews University and Dermot O’Reilly, Queens University Belfast

There are now three separate record linkage studies which include census and vital registration data covering England and Wales (the ONS LS), Scotland (Scottish LS) and Northern Ireland (NILS), managed in three separate statistical offices. The aim of this project is to develop methods and procedures for carrying out parallel and if possible combined analyses of these three studies. To do this, we use an exemplar research project to analyse the association between self reported health and long-term illness in the 2001 Census and subsequent mortality in all three studies. Our research objectives are first to analyse socio-economic and socio-demographic variations in reporting long limiting long-term illness and self rated health in 2001 within England & Wales, Scotland and Northern Ireland; and second to analyse associations between reporting of health status in 2001 and subsequent mortality (taking account of socio-economic and socio-demographic factors). This is an important issue as there are some suggestions of differing associations in constituent parts of the UK. Here, we report on our progress in carrying out parallel analyses, using the research project as an example to illustrate this.

Socio-demographic determinants of entry into and exit from long-term institutional care – a linked register based follow-up study of older Finns
Pekka Martikainen, University of Helsinki

Full abstract (Word)

For most elderly people long-term institutional care is often the last resort when physical and mental functioning fails. Furthermore, publicly funded long-term care takes up, on average, about one percent of GDP in OECD countries, and this may more than double by 2050. Thus, in terms of considering future policy choices and estimating long-term care costs, it is important to understand the determinants of long-term care. The aim of this study is to assess the effects of socio-demographic determinants on entry, exit and length of stay from long-term institutional care.

 
Day two: Tuesday 2 September 2008
 

Migration

Using IPUMS data from the 1999 Kenya Census to explore internal migration
David Owen, University of Warwick

Full abstract (Word)

Though it has been one of the most politically stable countries in Africa since achieving independence in the 1960s, Kenya faces substantial challenges in achieving the economic development necessary to lift its rapidly growing population out of poverty. There are substantial geographical differentials in economic development and levels of income and poverty within the country.


While sample labour force, health and poverty surveys are regularly conducted in Kenya, the most recent comprehensive data on the population is still the 1999 Population and Housing Census (preparations are currently being made for the 2009 Census). A five per cent microdata sample from the Census is made available via the IPUMS data archive at Minnesota.


This paper makes use of the IPUMS microdata for Kenya to explore patterns of internal migration within the country and the associations between migration and poverty differentials. The dataset provides information on demographic and economic characteristics of individuals within households, and also records an individual’s district of birth, district one year before the Census and length of residence at their current address.


A Gender Approach to Cultural Norms and Labour Migration: The Case of Migration from Romania
Raluca Prelipceanu, The Sorbonne Centre for Economics, University of Paris

Full abstract (Word)

Our paper analyses the determinants of the Romanian labour migration during the transition period. Our approach is comparative taking into account the differences between the migration decision of men and that of women. Our main research questions are “Do migration determinants differ according to gender?” and “Do local norms influence the propensity to migrate of women and that of men?”. In order to answer these questions, we build first of all a collective household model of the decision to migrate. We consider the household as being made of two members: the husband and the wife. The wife works on the labour market, but is also involved in the production of a domestic public good, in this case children, whereas the husband works only on the labour market. We consider the market for childcare to be incomplete so the household cannot employ another person to take care of the children. However, there is imperfect substituability between family members inside the household. As children are part of the utility function of both the husband and the wife, a decrease in the utility function of the children would also induce a decrease in their parents’ utility functions. Due to differences in the domestic labour supply, the migration cost in the case of women exceeds the migration cost of men. We consider this to be one of the main explanations for the low mobility of women. In addition, the migration decision is also influenced by the sharing rule which, in turn depends on the income of the family members and on distribution factors like gender and cultural norms.

Migration in National Surveys and Censuses
Adriana Castaldo and Gunjan Sondhi, University of Sussex

Full abstract (Word)

In recent years, as part of the overall increasing importance that the topic of migration - from and within developing countries - acquired globally, it has been amply acknowledged that improving the availability and the quality of data on migration flows and the characteristics of worldwide migrants is key to the achievement of an informed and sound design of migration policies. This applies both in the areas of the migration of adult individuals, as well as children. In an attempt to fill in the existing gap on the availability and use of quantitative data on adult and child migration, we have developed a unique catalogue of existing national-level household surveys and censuses in developing countries. This collection of household surveys and censuses contains information on and access to data that can be used to analyse child migration and forms an integral part of the catalogue “Migration in National Surveys” (MiNS).

This paper describes the MiNS catalogue and uses part of the data collected in the catalogue as the main tool to investigate how the theme of migration and specifically child migration is treated in some of the censuses; how this data has been and could be used in the study of child migration; and to address issues of data quality and access to data.

Using the 2001 Census Individual CAMS to explore the relationship between migration and longer distance commuting in rural England
Tony Champion, University of Newcastle

Full abstract (Word)

For some time there have been concerns about the effect of in-migration on the life of rural communities and on the sustainability of rural settlement more generally. Particular issues have been raised by the movement of people out from cities who continue to work there and therefore have long work journeys. Yet the evidence base on the links between migration and commuting has traditionally been weak, partly due to the dearth of large-sample datasets covering both of these behaviours in any detail. Even with the Population Census, it is only with the advent of the SARs that – short of obtaining commissioned tables – it has been possible to crosstabulate people on the basis of migrant status and journey-to-work pattern.

The research results reported in this paper take advantage of the richness of the data available in the Individual version of the Controlled Access Microdata Sample (CAMS). With its 3.125% sample of England’s residents, it contains derived variables on distance to work and on distance between current usual residence and that of one year ago. It also gives the local authority of both current and previous address, allowing a classification of migrants by the types of place moved from and to. Along with the detailed breakdown of all the personal characteristics collected by the Census, this makes it a powerful source for answering the study’s central question about how much the odds of being a longer distance commuter are affected by being a recent migrant and by the type of move made.

 

Health and Employment

South Asian ethno-religious groups and part time employment pattern in England and Wales: Results from the 2001 individual SARs
Reza Afkhami, University of Manchester

Full abstract (Word)

In recent years it has become clear that some minorities face larger penalties than others in the labour market (Craig, et al. 2005; Dale, et al 2006, 2002; Fieldhouse and Gould, 1998). It is also known that the nature and perhaps the extent of discrimination faced by ethnic minorities in Britain vary between different ethnic minority groups mainly Muslim south Asian, Modood (1997). A recent work by Simpson et al (2006) indicates a further research needed to investigate high rates of part-time working among some populations including Pakistani and Bangladeshi men. This is particularly important as Part-time working may imply lower incomes, poorer working conditions and barriers to full-time work.


This paper analyzed part time working among south Asian population in England and Wales. We tested whether south Asian ethno-religious composition can explain the different patterns of part time working in England and Wales.

Measuring Social Progress: Labour market position of 1st and 2nd generation minority ethnic groups in Britain (1991-2001)
Yaojun Li, University of Manchester

Full abstract (Word)

Ever since the 1991 Census, particularly the advent of the samples of anonymised records (SARs) at the individual and household levels with the ethnic data available, there has been a huge interest in the economic fates of the minority ethnic groups in Britain. Many studies were conducted as can be seen in the collection by Karn (1997, for a summary of the key findings and research potentials of the SARs, see Li 2004). This interest has also influenced the collection and analysis of data, particularly social surveys, by other government agencies and academic community. Indeed, there has been much more academic and policy interest in ethnicity than in the traditional mainstream sociological issues such as mobility and gender inequality. Yet, it is also the case that little systematic research is available on the social progress, or lack of it, in terms of the labour market position by first and second generation minority groups in Britain across the decade using standardised variables and methods.

This paper will use the 1991 and the 2001 SARs to address the issue. We shall use standardised variables at the individual level on ethno-generational status, labour market position in terms of employment and occupational (class) status, and a range of other personal attributes and human capital variables such as age, education and marital status as available in the SARs. We shall also construct and add new variables on the socio-economic development (GDP/GVA) and the ethnic diversity at the regional level at both time points to serve as contextual factors. The two sets of variables will enable us to conduct a systematic research on trends of ethnic penalty in British society over the period covered.

The Geography of Unpaid Caring in the UK
Paul Norman, University of Leeds and Kingsley Purdam, University of Manchester

A person is a provider of unpaid care if they give any help or support to family members, friends, neighbours or others because of long term physical or mental health or disability, or problems related to old age. It is estimated that there are over 6 million unpaid carers in the UK of whom 1.7 million provide over 20 hours of care per week. The number of ‘informal’ carers may need to double within a generation due to the ageing of the UK population. In this paper we examine geographic and social variations in the amount of unpaid caring across the UK. Using the Samples of Anonymised Records from the 2001 Census we map sub-national geographic variations in the amount of caring in the UK. We consider variations in caring at the individual level using a specially devised UK index of caring. The findings have important implications for our understanding of the dynamics of caring and for service providers at a national and local level.

Uses of census microdata for monitoring human resources for health in low and middle income countries
Neeru Gupta, World Health Organization, Switzerland

Full abstract (PDF)

Human resources are a strategic capital in any organization, but especially so in health and other service organizations which are highly dependent on their workforce. Despite the undoubted importance of monitoring human resources for health (HRH) and the impacts on health systems
performance and population health outcomes, in many countries the empirical evidence to support policy formulation is often fragmented and incomplete. Many sources that can potentially produce statistics on the health workforce remain underused. In particular, the potential of population censuses for HRH monitoring has generally not been met, especially in developing countries where access to microdata has generally been limited and comparisons across countries or time periods are difficult because of inconsistencies in both data and documentation.

The World Health Organization's Department of Human Resources for Health works to strengthen the global evidence base on HRH through the building of capacity for analysis and use of data from standard statistical sources, including census sources. This paper has three main components. First, we discuss the relative strengths and limitations of census data for health workforce analysis. Second, we present findings from recent research on HRH stock and distribution using microdata from the Integrated Public Use Microdata Series (IPUMS)-International databank for the 2000 round of censuses for selected low and middle income countries. Lastly, we explore opportunities for strengthening global health workforce analyses in the upcoming 2010 round of censuses.

 

International Migration

Status, class and immigration in the 1901 Census: results of applying 19th Century and 20th Century classifications to occupational data on immigrants and the native-born in the 1901 Census of Canada
Charles Jones, University of Toronto

Full abstract (Word)

Sociologists and Historians agree that occupation is an important indicator of an adult’s place in the social stratification system. However they have quite different views about whether stratification is most usefully defined as a continuous socioeconomic gradient or as a set of discrete social classes. Blau and Duncan’s seminal work, The American Occupational Structure uses both categorical and continuous approaches but more recent North American empirical research has mostly followed Duncan’s approach in seeking and using “a socio-economic index for all occupations” (Duncan’s SEI). This is seen most clearly in the IPUMS project where Ruggles and other members of the Minnesota group have used the 1950 US Census classification and SEI in order to code occupation data in all US Censuses from 1850 to the present. More recent non-US censuses and surveys have used the International Standard Classification of Occupations and an international socioeconomic index scale. Taking another direction there are now several 19th Century Census data files from countries bordering the North Atlantic where the occupational information has been coded according to the North Atlantic Population Project’s modifications of the more social-class oriented Historical Standard Classification of Occupations (HISCO): a classification that is more appropriate to agrarian societies with pre-modern social relations of production and large numbers of domestic service workers.

Family Formation of ‘Foreign Brides’ in Japan and Taiwan: A Comparative Analysis of Two Types of Censuses
Hiroshi Kojima, Waseda University, Tokyo

Full abstract (Word)

A Rapid increase in international marriages and the social integration of foreign female spouses (wives) and their children became a social issue in Taiwan. Consequently, the Taiwan Ministry of the Interior conducted the Census on Living Conditions of Foreign and Mainland Spouses in 2003 with all the foreign and Mainland spouses of Taiwanese nationals. In Japan the integration of foreign wives and their children became an issue earlier than in Taiwan, but the support measures were largely left to local governments and NGOs and the information is largely based on small-scale surveys. In fact, the Population Census is the only source of nationwide information on couples in international marriage in Japan. The purpose of this study is to clarify the determinants of family formation behaviors of foreign wives in international marriage in Japan and Taiwan to supplement insufficient information and to derive implications for Japanese official statistics, drawing on microdata from the 2000 Population Census of Japan and the 2003 Taiwanese census of foreign spouses.

Gender Ratios in Global Migrations, 1850-2000
Johanna Leinonen, University of Minnesota

Full abstract (PDF)

Scholars in many disciplines have observed that men historically far outnumbered women among international movers. Two of geographer E.G. Ravenstein's latenineteenth century "laws of migration" asserted that (1) short-distance migrants generally outnumbered longer distance ones, and that (2) within-country moves were usually dominated by women and between-country moves were dominated by men. Although many geographers have critiqued and extended Ravenstein's work, these two oftenrepeated laws have rarely been challenged since they were written in the late 1800s. Textbooks in demography and world history presented these theories as conventional
wisdom for much of the twentieth century (e.g., Peterson 1969: 264; United Nations 1979: 4; Manning 2005: 11).

In recent years, researchers at the U.S. Department of Labor and elsewhere have pointed toward a "remarkable shift" in migrant gender ratios, from women constituting less than one-third of all U.S.-bound migrants in 1900 to almost one-half in the 1970s (Houston et al. 1984: quote on 913, Simon and Brettell 1986). The United Nations' 2006 State of World Population Report advised policy-makers to take note of the fact that "today, women constitute almost half of all international migrants worldwide" (United Nations 2006). The U.N. report outlines a broad range of policy issues—such as social services, labor market policies, and migration regulations in sending and receiving countries—that are affected by this apparent shift in the gender composition of migration
streams.

 

Combining data

Combining 2001 UK Census data in a multilevel model
Mark Tranmer, University of Manchester

I explain how the research value of Census data is maximised by combining census aggregate and micro-data in a multilevel modelling framework, and illustrate this approach given the data availability from the 2001 UK census. I show how multilevel model parameters may be estimated, even though the available Census data are not of standard multilevel form. Moreover, I explain the important role of micro-data in this estimation process. Finally, I provide some empirical results based on an analysis of the 2001 census data, and simulations based on the 2001 data structure. These results suggest that there it is highly beneficial to combine aggregate and micro data in this situation, rather than working with a single source such as the census aggregate data.

These methods will benefit Census data users as well as having more general impacts in social statistics and human geography. The methodological developments are also relevant to many other situations where multiple sources of data for the same population are combined, with implications for design, analysis, confidentiality protection and data-release.

Using Targeted Perturbation of Microdata to Protect Against Intelligent Linkage
Mark Elliot, University of Manchester

Full paper (Word)

This paper describes linkage experiments using the 2001 individual Sample of Anonymised Records (SARs) from the UK census to the microdata output from the UK Labour Force Survey (LFS) for spring 2001.

The objective of the study was to assess the impact of the statistical disclosure control methods on the used on the 2001 SARs on the ability to link an external dataset and SARs. The Labour Force Survey was selected as the external file because (i) it was of sufficient size to produce a large enough overlap with the SARs and was collected around the census date.

The project follows the tradition of other such studies with official data; e.g. Muller, W; Blien, U.; and Wirth, H. (1992), Elliot and Dale (1998). However, the study here elaborates on that earlier work by examining the impact of a targeted disclosure control technique on the ability of an intruder to attack a dataset by focusing on the high risk records.

 

Family formation

The Mixed-Ethnic Unions in England and Wales in the 1990s

Zhiqiang Feng, University of St Andrews

Full abstract (Word)

Although developed societies are becoming increasingly ethnically diverse, relatively little research has been conducted on mixed-ethnic unions (married or cohabiting). Those studies which have been undertaken demonstrate that mixed-ethnic unions account for a small fraction of total couples, but they are growing in number. In England and Wales, the number of mixed-ethnic unions increased by 65 percent between 1991 and 2001. Previous studies in Britain tended to focus on a general picture of the number and trends of mixed-ethnic unions (Berrington, 1996, Coleman, 2004). One exception is the study by Muttarak (2004) which uses Labour Force Survey (LFS) to analyse the factors contributing the formation of mixed-ethnic unions. Our study uses data from 1991 and 2001 Household Samples of Anonymised Records (Household SARs) which are based on a 1 percent sample of the total households in England and Wales. The Household SARs provide information on both partners in the family, which allows us to identify mixed-ethnic unions and the large sample size allows us to investigate mixed-ethnic unions by individual ethnic group. In the study we document the growth of mixed-ethnic unions by ethnic group in the 1990s. We also examine whether there are differences between mixed- and co-ethnic unions in terms of demographic and socio-economic characteristics, focusing particularly on age, generation, education, and social class. Furthermore, we use log-linear models to investigate whether the propensity of out-partnering for different ethnic groups has changed between 1991 and 2001.

Studying links between disability and family formation: Evidence from the SARS and other data sources
Stephen McKay, University of Birmingham

Full abstract (Word)

There is mixed research evidence on the links between family formation and disability. Some studies of specific illnesses and health conditions have found strong links, with disability-related conditions increasing the risk of family breakdown, whilst other studies have found no link. Few such studies have been conducted in the UK. The other main limitations of UK-based research has been relatively small sample sizes, concerns about the appropriate measurement of disability, and especially over-reliance on a medicalised model of disability, and the lack of an overall structure in which to conceptualise family transitions.


The study examined:


1. Family breakdown – that is, splitting up among both married and cohabiting couples.
2. The rate of single people becoming couples (‘partnering’).
3. Effects on family size, through any changes in the timing of births and whether they happen at all.

 

Poverty Mapping

Micro-level and Macro-level Effects of Family Poverty in the Texas Borderland and the Lower Mississippi Delta: United States, 2006
Dudley L. Poston, Texas A&M University

Full abstract (Word)

The Texas Borderland and the Lower Mississippi Delta are the two poorest regions in the United States (see Figure 1 below). Each is characterized by high concentrations of racial and ethnic minorities, Mexican Americans in the Borderland and African Americans in the Delta. The high and persistent poverty suffered by the residents of these regions has much to do with the disadvantaged socioeconomic position of racial and ethnic minorities in the United States.
This paper uses microdata from the 2006 American Community Survey for families residing in the Borderland and Delta. We first estimate the log odds that a family is poor. We use three different definitions of family poverty: 1) whether or not the family has annual money income that is 50% or less of the official poverty threshold (“in deep poverty”), 2) whether or not the family has annual income that is 100% or less of the official poverty threshold (“in poverty”); and 3) whether or not the family has annual money income that is 150% or less of the official poverty threshold (“near poverty/in poverty”).


Census microdata and poverty mapping – a validation study using Brazilian data
Pedro Luis do Nascimento Silva, University of Southampton

Full abstract (Word)

‘Poverty mapping’ has attracted widespread attention in recent years. This is an approach to the estimation of poverty indicators at small area level developed in response to the need for targeting public policies to alleviate poverty. An approach which is currently advocated by the World Bank and has been used in several countries is based on combining high quality information on income (or expenditure) collected using sample surveys designed to estimate for national or broad-level geographies, with detailed information (microdata) provided by a recent census of population (Elbers, Lanjouw and Lanjouw, 2002 – abbreviated ELL). The idea is to fit a model to explain income (or expenditure) using the sample survey data, with predictors which are also measured on the census. Then the model is used to perform ‘mass random imputation’, namely imputing income (or expenditure) for each census record (household). The imputed values are used to calculate the target poverty indicators at the level of the small areas of interest. Multiple replicates of the imputed census file are used to average the poverty indicators and to obtain corresponding standard error estimates.

A Technique for Small Geographical Scale Poverty Analyses: The use of Census Microdata in the Case of Liberia, Costa Rica
Carlos de la Espriella, Lund University, Sweden

Full abstract (Word)

Poverty analyses are instrumental for a pro-poor urban planning practice, but they seem weak in addressing the causes of poverty at city level in developing countries, because they are seldom articulated in a strategic set of pro-poor local actions, and they do not reach small geographical scales at city level where actions should be implemented.


In an attempt to bridge this methodological gap, a technique for undertaking poverty analyses at city level was tested in the analysis of the implementation of Costa Rica’s social housing policy in the medium-sized city of Liberia. The analysis used 2000 National Population Census data of the city of Liberia, desegregated at small geographical units .

 
Balancing access and confidentiality
 

The Research Data Centre Program: from a seed to a forest. A fundamental element of the social research infrastructure in Canada

Gustave Goldmann, Statistics Canada

Informed decision making on social issues requires current, comprehensive and very well-targeted research. Societies face two primary challenges in order to respond to this need for timely information – access to relevant data and a corps of qualified researchers to conduct the analyses. As part of a response to the challenges that confront Canadian policy research, a network of Research Data Centres was formally launched in December 2000. There are currently 25 Research Data Centres and Branches located throughout the country, so researchers are not obliged to travel to Ottawa to access Statistics Canada data. At the same time, the centres are administered in accordance with all the confidentiality rules required under the Statistics Act. The Research Data Centres meet, in a single location, both the need to facilitate access to detailed micro-data for crucial social research and the need to protect the confidentiality and security of Canadians’ information. The research conducted in the centres contributes significantly to the public debate on key issues that are of concern to Canadian society. The network expands the collaboration between Statistics Canada, the Social Sciences and Humanities Research Council, The Canadian Institutes for Health Research, universities and academic researchers. It is also instrumental in training a next generation of Canadian quantitative social scientists. The Canadian RDC Network serves as a model for achieving a balance between providing open access for research and respecting the confidentiality of official data.

Improving access to Census Microdata - the Australian experience

Jenny Telford, Australian Bureau of Statistics

With the increasing pressure to release more detailed data to clients, protecting the confidentiality of respondents has never been more important than now.

The Australian Bureau of Statistics, like statistical agencies around the world, is committed to protecting the privacy of our respondents and preventing any form of unit level identification. This presentation will focus on the methods being developed by the ABS to improve the way clients access and interrogate microdata while balancing the risk of disclosure with procedural and technical protections. We will outline the advances made through the 2006 Census output program with the development of TableBuilder and provide an overview of the Remote Access Data Laboratory.

Microdata access and confidentiality issues in Germany

Heike Wirth, ZUMA, Germany

Full abstract (Word)

The focus of this presentation is on the possibilities of microdata access in Germany, which might be slightly different to other countries. Along the lines “chi va piano va sano” the microdata access has improved slowly but continuously in Germany over the last two decades. The most important improvements are reflected in (1) an increase of producers of official microdata making their data available for research purposes; (2) a standardization of data access; (3) different types of data access. Microdata can be used in terms of (a) scientific use files (off site access); (b) on site access; (c) remote access. Regarding the flexibility of data analysis (in terms of e.g. time, costs and keeping track of research ideas) scientific use files are the type of microdata access preferred by most researchers. Scientific use files are microdata which are anonymized according to the principle of ‘reasonable anonymity’ (faktische Anonymität) and are released for academic research only. ‘Reasonable anonymity’ means that the reidentification of respondents is not impossible (‘absolute anonymity’) but comes along with a disproportionate amount of time and costs.

 
Day three: Wednesday 3 September 2008
 

Census futures

Routes to access: what affects the decision that statistical offices make on research access
Paul Jackson, Jane Longhurst and Judy Hawkins, Office for National Statistics, UK

The primary debate is over. With suitable safeguards, it is appropriate for a statistics office to provide research access to confidential data. For ONS, this is confirmed by features of the Statistics and Registration Service Act. This presentation will look at the real-world factors that have determined the routes to confidential ONS data currently on offer. It will also look at the pinch points that will shape future routes to access, and how the research community can help with addressing them.

The Danish system for researchers access to microdata

Ivan Thaulow, Danmarks Statistik

Full abstract (Word)

The access of researchers to Danish microdata as well as the quality of the available administrative data in Denmark has improved significantly in the recent years. Through the Research Services at Statistic Denmark researchers have now access to a large number of administrative registers in many fields of research such as labour market research, sociology, epidemiology and business economics.

In Denmark researchers are only given access to de-identified microdata. However, the strength of the Danish system is that we use almost universal identification keys (such as “person numbers” and “central business register numbers”) in most of the administrative registers. Thus it is possible for Statistic Denmark to correlate the aggregated data before data is de-identified and made readable to the researcher. Data can be correlated and aggregated both within a specific year and longitudinally across several years. This creates unique possibilities to combine knowledge over time from different administrative registers in order to be used in social and medical research.

The UK Census: future directions
Peter Fullerton, Office for National Statistics

The decennial UK Census is one of the most important statistical data sources in the UK. Apart from its wide use in government, academia and the commercial sector it is the key data source to measure the number of people living in the UK, the households they live in, and their key individual characteristics.

Although it is recognised the Census is an important data source, there have been recent developments in the population which make it more difficult for a Census to provide up-to-date measures. Because of the increase in migration (both internal and external) a decennial measurement of the population is seen as insufficient and it poses some serious challenges to government in the way it sets policies and allocates local funding for services. This gap is increasingly being recognised by a number of independent observers and has been the subject of criticism by the Parliamentary Treasury Committee.

In responding to this criticism the Office for National Statistics recognises the need for improvements in the way we measure the population. In looking at alternative approaches it is clear that there are many data sources available which, if used in combination, could provide the potential to improve measurement of a mobile population at small area levels. The presentation will set out some of the key issues and will consider advantages and disadvantages of alternative approaches to count the population and its characteristics.

French census microdata: changes ahead

Jean-Francois Royer, Institut National de la Statistique et des Études Économiques (Insee), France

Full abstract (Word)

The last traditional census in France took place in 1999. The micro data sets available for researchers from that operation have been limited: first by the confidentiality rules of the French statistical law (“loi de 1951”) which does not contain exceptions for research as far as personal data are concerned; second, by the supplementary rules adopted on the recommendation of the “National commission for data processing and liberties (CNIL)” in order to prevent the risks of “profiling”. These rules define some information as “sensitive” (nationality, country of origin, date of first entrance to France) and put thresholds on the size of the geographical units that can be described by micro data files; these thresholds vary according to the nature of variables contained (sensitive or not, detailed or not) and according to the sampling rate. For example a 1/20th dataset has been put in public use, with almost all the variables (but not the sensitive ones, nor the census district); a 1/4 dataset has been created for research, with all the variables except detailed geographical ones (under the “region” code), etc. Special provision has been made for local authorities: they can receive more detailed information than ordinary users. For researchers who need more information than the public one, the only way to process census data is to ask INSEE for customized tables (this is a paying service, through a dedicated laboratory “Centre Quetelet”). Special mention can be made of two derived files: “SAPHIR” and “EDP”. The first (SAPHIR) is a collection of micro-data of several French censuses (from 1962 to 1999) with coordinated codes. The second (EDP : “échantillon démographique permanent”) is a 1/100 sample of census and register data collected and matched for the same people (panel). These files have been kept inside INSEE for confidentiality reasons, and only researchers working with INSEE have been able to use them.

The Census Public Use Microdata File (PUMF) at Statistics Canada
Sri Kanagarajah, Census Operations Division, Statistics Canada

Full abstract (Word)

From 1971 to 2001, Statistics Canada have been producing three single files with no relation between the files: Individuals; Families; Household and dwelling. After consultations with users (academic, private sector and federal department), they promoted a new approach for 2006 PUMF. They would like to create one single file for individuals (size: 2.7% of the population) and one hierarchical file (size: 1 % of the population) with individuals, family and household and a link between the three universes. This allows users to link and study relationships between individuals and their families or households as well as allows international comparison.

 
ESRC Contact SARs Support | CCSR
These pages are maintained by the SARs support team.
Send us comments on this web page.