You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
# HMC Community Survey 2021 - Column Mapping Guide
2
+
3
+
This document provides a comprehensive mapping of the columns in the `responses_cleaned_mapped_to_publish.csv` dataset to their corresponding survey questions from the HMC Community Survey 2021.
4
+
5
+
## Survey Overview
6
+
7
+
The HMC Community Survey 2021 was conducted to understand research data management practices among researchers in the Helmholtz Association. The survey used a **dynamic questioning approach** where follow-up questions were shown based on previous answers, explaining the varying column counts per section.
System-generated fields for survey administration and analysis.
96
+
97
+
-`id` - Response identifier
98
+
-`interviewtime/_` - Interview duration
99
+
-`lastpage/_` - Last page reached in survey
100
+
-`submitdate/_` - Submission timestamp
101
+
102
+
## Survey Logic and Adaptive Questioning
103
+
104
+
The survey implemented **conditional logic** where:
105
+
- Questions were dynamically adapted to respondents' expertise levels
106
+
- Follow-up questions appeared based on previous answers
107
+
- Different paths were available for different experience levels
108
+
- Not all respondents saw all questions
109
+
110
+
This explains why there were 305 possible columns in the survey design, but the published dataset contains only 263 columns after data cleaning and anonymization.
111
+
112
+
## Key Survey Focus Areas
113
+
114
+
The survey particularly focused on understanding:
115
+
116
+
1.**Current practices** in research data management
117
+
2.**Metadata handling** and documentation approaches
118
+
3.**Data publishing behaviors** and motivations
119
+
4.**Support needs** for FAIR data implementation
120
+
5.**Barriers and obstacles** researchers face
121
+
6.**Community-specific requirements** across six Helmholtz research fields
122
+
123
+
## Research Fields Covered
124
+
125
+
The survey covered all six Helmholtz research fields:
126
+
- Aeronautics, Space, and Transport (AST)
127
+
- Earth and Environment (E&E)
128
+
- Energy
129
+
- Health
130
+
- Information
131
+
- Matter
132
+
133
+
## Data Collection Details
134
+
135
+
-**Survey Period**: September to November 2021
136
+
-**Total Responses**: 631 completed responses
137
+
-**Implementation**: LimeSurvey platform
138
+
-**Data Collection**: Fully anonymized
139
+
-**Target Group**: Scientific staff across all Helmholtz research centers
140
+
141
+
## Data Processing and Column Reduction
142
+
143
+
The published dataset contains **263 columns** rather than the full 305 possible columns from the survey design. This reduction occurred during data processing for the following reasons:
144
+
145
+
1.**Anonymization**: Institutional affiliation data and other identifying information was removed
146
+
2.**Privacy protection**: Software names used by fewer than 4 respondents were anonymized
147
+
3.**Data cleaning**: Empty or unused columns may have been filtered out
148
+
4.**Conditional questions**: Some survey paths may not have generated responses, resulting in unused columns
149
+
150
+
The report specifically mentions: "Before the data publication the following information was removed or anonymized from the survey data in order to prevent the identification of individuals: Any information – including that might reveal a respondent's institutional affiliation, Names of software that is used by less than 4 respondents, Any information about institutional repositories."
151
+
152
+
## Usage Notes
153
+
154
+
- Column headers use a hierarchical naming convention (GROUP/SUBQUESTION/OPTION)
155
+
- Multiple choice questions have separate columns for each option
156
+
- Rating scales and slider questions have numeric values
157
+
- Free text responses were cleaned and categorized where applicable
158
+
- The `/_` suffix typically indicates single-choice or numeric responses
- ✅ **Color Utility Functions** - Easy-to-use functions for getting HMC colors in charts
336
+
337
+
---
338
+
253
339
## ❓ Frequently Asked Questions
254
340
255
341
### Q: Does the refactored version work exactly the same?
256
-
**A:** Yes! All functionality is preserved. Users see no difference, but developers get a much better codebase.
342
+
**A:** Yes! All functionality is preserved. Users see no difference, but developers get a much better codebase with official HMC branding.
257
343
258
344
### Q: Do I need to change deployment scripts?
259
345
**A:** No changes needed. The same Panel serve command works exactly as before.
260
346
261
347
### Q: Can I still modify the dashboard?
262
-
**A:** Yes, but it's now much easier! Check the [Developer Migration Guide](developer-migration-guide.md) for details.
348
+
**A:** Yes, but it's now much easier! Check the [Developer Migration Guide](developer-migration-guide.md) for details. Plus you now have official HMC colors available.
263
349
264
350
### Q: What about performance?
265
351
**A:** No performance impact. The modular structure may even be slightly faster due to better organization.
266
352
267
353
### Q: How do I add new features?
268
354
**A:** Much easier now! Each type of change goes to its specific module. See the [Module Architecture](module-architecture.md) guide.
269
355
356
+
### Q: How do I use the new HMC colors?
357
+
**A:** Import from `hmc_layout.hmc_colordicts` - colors are automatically applied to charts, or use `get_hmc_colors(n)` for custom visualizations.
0 commit comments