Inhaltsverzeichnis

Additional Variables in the Data Set

There are additional variables before (left) and after (right) of your question's variables. This chapter will shortly describe their meanings.

Note: Some variables must explicitly be enabled before starting the download.

Note: For privacy reasons, recording data from the user's browser (browser, referer, IP address, etc.) needs to be to activated before collecting data.

Interview Identification

Interview Progress

These variables are placed at the data set's end.

Completion Times

Translated with DeepL.com (free version)

Note: The parameters TIME_SUM and TIME_RSI only contain a value if the downloaded data set contains at least 10 records for the respective questionnaire (selection_criteria_filter). The more records the download contains, the more accurate the values for TIME_SUM and TIME_RSI will be, because the distribution of response times in the sample is used to clean outliers or to normalize them.

Note: The response times are only included in the data set if the option to download the dwell times has been checked the variables selection of the download options. This option is checked by default.

Note: Processing times are recorded automatically. To deactivate the recording, please uncheck the option in Survey ProjectProject Settings → tab Privacyrecord time and duration during the survey.

Quality Indicators

Data quality in online surveys is usually quite good. Data cleaning, however, is necessary in mostly every survey. When using the option Variables selectionDownload data quality parameters SoSci Survey provides variables to support data cleaning:

The variables LASTPAGE and FINISHED can be used to determine whether a questionnaire has been completed in full (see above). The proportion of missing information (MISSREL) is a valuable indicator of the diligence of the participant or for data sets that originate from “just having a look”. Although the time invested in completing the questionnaire is not a direct indicator of data quality, very low response times (low TIME_SUM and high TIME_RSI) indicate that the questions were not even read.