Projects
Explore further details on our ongoing projects for a deeper understanding of our current initiatives.
Current projects
SPRITE+ brings together people involved in research, practice, and policy with a focus on digital contexts. We are a 'one stop shop' for engagement between academic and non-academic communities - a way for these communities to connect and a platform for building collaborations across the spectrum of issues relating to security, privacy, identity and trust.
We are funded until 31 August 2027 under the Engineering and Physical Science Research Council (EPSRC) Digital Security and Resilience Theme (grant reference EP/W020408/1).
SPRITE+ is led by a consortium of five universities: University of Manchester (lead institution), Imperial College London, Lancaster University, Queen’s University Belfast, and University of Southampton. Our Management Team is supported by a multi-disciplinary group of more than 190 Expert Fellows and 24 Project Partners from different sectors, including industry, government, law enforcement, and civil society.
More info can be found on the SPRITE+ website.
DIGISURVOR is an innovative project that provides vital knowledge that enables a transformational shift in how Digital Footprint Data can be anonymised and linked to survey responses, thereby opening it up for ethically compliant secondary analysis.
The project is inter-disciplinary and brings together expertise from political science, social statistics and computer science. It will proceed in three main phases.
DIGISURVOR will make a significant substantive, methodological and ethical contribution to the use of linked survey and digital footprint data within academia and the wider survey research practitioner and user community. More generally, the project will contribute directly to furthering the ESRC Digital Footprints research agenda in terms of building the foundations for rigorous, ethical and societally relevant future research using DFD, and widening access to this new resource across the research community.
The UK Data Service is the principal repository for economic, population, and social research data in the UK. As hosts of the largest trusted digital archive of its kind, our expertise in the collection, preservation, and dissemination of quality data is the culmination of nearly sixty years of sustained investment by the Economic and Social Research Council (ESRC) in the UK’s research data infrastructure.
Meticulously curated over this period from trusted data providers including governments, the Office for National Statistics, the ESRC, and diverse other funders and data owners, our digital collection comprises nationally and internationally significant datasets such as the Census, Understanding Society, the UK Cohort Studies, the Labour Force Surveys, the Family Resources Surveys and many others.
Pioneers in data curation, data literacy and actively managing long-term access to high quality data, our expertise continues to transform social science research, teaching and learning.
More information can be found on the UKDS website.
The Survey Data Collection Methods Collaboration (SDCMC) is a response to post pandemic challenges and aims to deliver a step change in approaches to collecting population survey data in the UK to ensure that it will remain possible to carry out high quality social surveys of the kinds required by the public and academic sectors to monitor and understand society, and to provide an evidence base for policy. It does this primarily through a rigorous programme of research focused on ensuring large-scale social surveys in the UK can innovate and adapt in a changing environment and continue to deliver high quality and inclusive data.
The primary aim of the programme of work is to assess the quality implications of the most important survey design choices relevant to future UK surveys and provide good practice guidance and practical training materials, while a secondary aim is to identify promising ways to improve the capacity and skillset of both interviewers and research professionals and take steps towards making those improvements.
The SDCMC will generate a range of research and training outputs and will engage in a programme of dissemination and promotion activities. Outputs will have a strong practical orientation, consisting of good practice guidance for survey design, survey implementation, survey commissioners and survey data users, all backed up by rigorous and well-documented research and with a range of associated activities to ensure that the lessons are disseminated to all relevant stakeholders and, where appropriate, embedded in institutional practice in a timely manner. The project will also seek to enable a whole community dialogue and collaborative response to wider strategic challenges and issues, as well as incorporating a strong training and capacity building component.
This project aims to build capacity within the international data services community, by providing upskilling opportunities for UKDS staff and developing foundational level data skills modules in computational social science for the wider global community.
It establishes a community of practice to provide enhanced support to users through the lifetime of the project and beyond. Through upskilling mechanisms, this project will enhance data services capacity both in the UK and globally, enhance the careers of data service professionals, and through the establishment of a Community of Practice will contribute to a culture of lifelong learning.
This project explores the capacities of recently advanced network modelling methods in testing core social
theories: symbolic interactionism and social constructivism. and will provide the first comprehensive
quantitative test of such theories.
These new methods enable the disentangling of the complex structures that relate persons, words, and material objects in small groups of collocated individuals. This, in turn, enables cross-validation of the fundamental assumptions the two theories have regarding how cultural meanings are formed across different types of interaction, structural levels, and time spans.
The project utilised a globally unique multi-dimensional dataset that captures socio-cultural dynamics in five groups over two years. Development of statistical models for the analysis of how cultural meaning is created in society will constitute a major contribution to social science. Results of the testing of core social theories have the potential to trigger fundamental changes in social sciences and beyond.
The British Election Study (BES) is one of the longest-running election studies worldwide and the longest-running social science survey in the UK.
It has made a major contribution to the understanding of political attitudes and behaviour over nearly sixty years.
Surveys have taken place immediately after every general election since 1964.
The first study conducted by David Butler and Donald Stokes in 1964, transformed the study of electoral behaviour in the UK.
Since then the BES has provided data to help researchers understand changing patterns of party support and election outcomes.
Past projects
Immigration is a major political issue, with increasing media coverage, rising anti-immigration sentiment, and the rise of anti-immigration political parties.
The issue of migration sits centrally within the wider debate about ethnic and religious diversity and its effects on social cohesion.
We are still, though, a long way from understanding these issues and their potential consequences.
They seem to rest on beliefs about national identity and ethnicity, but cannot be divorced from the effects of social class, education, economic competition, and inequality, as well as the influences of geographical and social segregation, social structures, and institutions.
This project will integrate two very different disciplines, social science, and complexity science, to gain a new understanding of these complex, social issues.
It will do this by building a series of computer simulation models of these social processes.
One could think of these as serious versions of the Sims computer games, programmes that track the social interactions between many individuals.
Such simulations allow ‘what if’ experiments to be performed so that a deeper understanding of the possible outcomes for society as a whole can be established based on the interactions of many individuals.
BADEN funded by The Leverhulme Trust gathers researchers from academia and national statistical offices and gives a strong impetus to theory development and practical implementation of adaptive survey designs.
Adaptive survey designs differentiate features for different population subgroups based on auxiliary data about the sample obtained from frame data, registry data, or paradata.
The development of a Bayesian framework will allow the learning and constant updating of key input parameters to these designs. More specifically, our objectives are as follows.
- To bring together researchers periodically and to speed up theoretical development and practical implementations of adaptive survey designs.
- To establish a cross-institute research agenda for the main research aim.
- To design and implement joint simulation studies.
- To support discussion on theory and the exchange of empirical results.
- To disseminate work to a larger public to advocate ideas and to get feedback on feasibility and utility.
- To lay the groundwork for joint papers and other forms of collaboration.
- To assist implementation of adaptive survey designs.
People
- Prof Natalie Shlomo - Principal Investigator (PI)
- Dr Stephanie Coffey - US Census Bureau
- Dr Gabriele Durrant - University of Southampton
- Dr Peter Lundquist - Statistics Sweden
- Mr Daniel Pratt - RTI International, North Carolina
- Dr Barry Schouten - Statistics Netherlands
- Dr James Wagner - University of Michigan
- Ms Rebecca Moore - Network Facilitator
CoDE is an interdisciplinary programme of research concerned with understanding changing ethnic inequalities and identities.
Our team has more than 20 academics, many affiliate members, and PhD students working from Glasgow, Oxford, and Manchester. We also have many valued partners.
CoDE utilises a variety of research techniques and tools to ensure that the potential economic and social benefits of our research are realised.
Our focus is on the changes within ethnic groups (their internal structures and formulations of identities) and their external relationships and position in British society. Bringing together sociologists, demographers, historians, geographers, and political scientists, we are researching:
- how class, gender, generation, age, and place produce different experiences and visions of ethnicity across the UK;
- how changes in ethnic identities over time were expressed through the emergence of new or mixed identities, as well as the shifting significance of language and religion as a marker of ethnicity;
- the significance of the context of emigration and arrival in shaping ethnic identities and the long-term trajectories of migrants in British society;
- how major social changes in Britain’s economic and political structures have impacted the ethnic inequalities experienced in employment and politics today.
Using a longitudinal outlook, the project investigates the outcomes and trajectories of ethnic minority inclusion within schools; the labour market; and civic and political life in Britain and Canada, as well as the role that 'family capital' plays in determining these inclusion patterns.
Social cohesion is perceived as an important social goal within academic and policy circles.
A crucial precondition to this goal is social inclusion, i.e. ensuring that individuals become full members of society by accessing societal resources and institutions.
In recent decades, fast-growing ethnic diversity has led to intensifying policy and academic debates on how this is affecting processes of social inclusion and cohesion.
Examining the factors influencing ethnic inclusion has thus increased in importance.
The project contributes to this debate by exploring the dynamics of ethnic inclusion into the economic and civic-political spheres and the interplay of these two separate but closely connected spheres.
It also investigates how these lifelong processes are shaped by family capital, i.e. a broad and complex range of cultural, social, economic, and political influences that are created within the family context. More specifically, the research addresses four core questions.
- What are the pathways to socio-economic and civic-political inclusion of ethnic minorities into British and Canadian institutions?
- What role does family capital (financial, human, social, and cultural) play?
- What are the best ways to address the differentials in family influences?
- Do different policy environments generate different types of outcomes?
To answer these questions, the study draws on available secondary data sources (including Understanding Society and the Millennium Cohort Study in Britain and the National Longitudinal Survey of Children and Youth and the Survey of Labour and Income Dynamics in Canada) and uses advanced longitudinal and multilevel methods of data analysis.
Missing data due to non-response imposes a serious threat to the quality of surveys and register-based statistics.
Non-response is often not a random phenomenon. It usually depends on demographic and socio-economic characteristics of individuals or enterprises. Also, the data collection process may have a substantial influence.
The response rate is often used as an indicator of survey quality. It has the advantage that it can be easily computed. However, low response rates will not necessarily cause estimates to be biased. There are ample examples in the literature where increased data collection efforts have led to a higher response rate but also to a larger non-response bias.
To assess the effects of non-response on the quality of statistics, other quality indicators are needed. These indicators should measure the degree to which respondents and non-respondents differ from each other. In other words, such indicators should measure the degree to which the group of respondents in a survey or register resembles the population. The indicators are called Representativity Indicators or, for short, R-indicators.
It is the objective of RISQ to develop R-indicators, to explore their characteristics and to show how to implement and use them in a practical data collection environment.
The project will demonstrate that R-indicators are not only used in the analysis of survey data but also during fieldwork. They can be used to monitor data collection processes, and therefore facilitate efficient allocation of interviewing resources.
The 7th Framework Programme
RISQ is financed by the 7th Framework Programme (FP7) of the European Union. It is supported by the "Cooperation Programme". This programme has ten distinct themes. RISQ is part of the theme "Socio-economic sciences and the Humanities", and within this theme of Activity 8.6 ("Socio-economic and scientific indicators"), 8.6.3 ("Provision for underlying official statistics").
Publications
Deliverables
Work Package 2 / Deliverable 1: Schouten, B. (2008), Documentation of datasets
Work Package 6 / Deliverable 5: Loosveldt, G. & Beullens, K.(2009), RISQ - Fieldwork Monitoring
Papers
Presentations
Abstracts
Schouten, B. & Bethlehem, J. (2008), Special Topic Session on Quality Indicators. Q2008, Rome, Italy
Tools
Random generation of auxiliary variables
In Schouten (2015, Discussion paper 2015-15, CBS, Den Haag, The Netherlands) and Schouten (2017, JRSS series B), R-indicators and coefficients of variation (CV) are evaluated under random generation of auxiliary variables. A framework is presented for various variable-generating distributions and the expected amount of explained variation is derived. In the 2017 JRSSB paper, a data set extracted from the CentERdata LISS-panel, see www.lissdata.nl, features as an example. The data set and R code to reproduce the example are available here. The SPSS file contains a codebook.
You can download the following files:
Computation of R-indicators - Version 2.1
An extended version of the RISQ 2 code in SAS and R was released at September 14, 2015. RISQ 2.1 estimates partial CV at the variable-level and at the category level. Furthermore, it approximates standard errors for the overall CV and partial CV. The manual is extended and discusses the new indicators.
You can download the following files:
- The manual RISQ Manual Version 2.1
- The SAS code (R-indicators Version 2.0.(SAS)
- The R code (R-indicators Version 2.0 (R)
Computation of R-indicators - Version 2.0
This is the new version of the RISQ code in SAS and R. It was released in February, 2014. RISQ 2 includes standard error approximations for all indicators, and it also has the coefficient of variation. Furthermore, bias adjustment has slightly changed with respect to version 1 of the code. The use of the functions is explained and illustrated in the manual.
You can download the following files:
- The manual RISQ Manual Version 2
- The SAS code (R-indicators Version 2.0.(SAS)
- The R code (R-indicators Version 2.0 (R)
Computation of R-indicators - Version 1.0
The computation of R-indicators and partial R-indicators are implemented in SAS and R. The code for both SAS and R can be downloaded. The code contains all functions needed to compute:
- R-indicators
- Unconditional partial R-indicators at the variable level and the category level
- Conditional partial R-indicators at the variable level and the category level
- The use of the functions is explained and illustrated in the manual.
You can download the following files:
- The manual (RISQ Deliverable 12.1)
- The SAS code (RISQ_R-indicators_v1.0.sas)
- The R code (RISQ_R-indicators_v1.0.r)
- The SPSS test data set used in the manual (RISQ-test.sav)
The program Cockpit
Cockpit is a software tool that demonstrates the graphical possibilities of survey response analysis with R-indicators.Cockpit can generate in ann interactive way the following plots:
- Box plots of response probabilities for each category of an auxiliary variables
- Bar plots of unconditional R-indicators for sets of auxiliary variables
- Bar plots of unconditional R-indicators for the categories of an auxiliary variable
- Bar plots of conditional R-indicators for sets of auxiliary variables
- Bar plots of conditional R-indicators for the categories of an auxiliary variable
Cockpit requires a data file and a metadata file. Such files can be generated from SPSS or Stata files with R scripts that are included in the package. Also included is a demonstration data set with data form a Statistics Netherlands survey.
It should be noted that Cockpit is just a demonstration tool and not a production tool. No bias correction is carried out in the computations of the various R-indicators.
A zip-file can be downloaded. This contains the following files:
- cockpit.exe (the program itself, non-installation required)
- cockpit.pdf (the cockpit manual)
- gps.rin (sample metadata file)
- gps.dat (sample data file)
- export-spss.r (R-script to export data and metadata from SPSS to Cockpit).
- export-stata.r (R-script to export data and metadata from Stata to Cockpit).