ARC Variable Naming Conventions¶
This document describes the variable naming conventions used in ARC, which support consistent structure, clarity, and integration with data collection and harmonization tools like ISARIC BRIDGE and ISARIC VERTEX.
1. General Naming Structure¶
Variable names follow a modular convention using underscores (_) to separate components:
[domain]_[topic]_[detail]
domain: thematic area (e.g.,demog,inter,labs)topic: clinical concept (e.g.,sex,suppleo2,glucose)detail: optional element used to indicate format, type, or follow-up
Example:¶
demog_occupation_oth: Open text field specifying occupation if “Other” was selectedinter_nasalprongs_dur: Duration (in days/hours) of nasal prong oxygen therapy
2. HierARCy Depth¶
Variable names contain varying numbers of underscores to reflect their specificity:
Number of Underscores |
Meaning |
|---|---|
1 ( |
General field (e.g., |
2 ( |
Component of a broader concept |
3+ |
Further detail or sub-field (less common) |
3. Common Suffixes¶
Below are common suffixes that reflect variable function or logic. All examples are real variables from the dataset:
Suffix |
Description |
Examples |
|---|---|---|
|
Text field for specifying “Other” |
|
|
Specifies type or category of a condition |
|
|
Date field |
|
|
Specification detail for a selected category |
|
|
Duration of an intervention or event |
|
|
Indicates whether a treatment/condition is ongoing |
|
|
Numerical count |
|
|
Site or anatomical location |
|
|
Lab measurement in mg/dL |
|
|
Percent value |
|
|
Yes/No binary question |
|
Numbered Suffixes for Repeated Instances¶
Some suffixes include numbers (e.g., _date1, _type2) to capture repeated events or values, especially useful for vaccines, medications, and other longitudinal data.
Suffix |
Description |
Example Variables |
|---|---|---|
|
First date of event |
|
|
Second instance (e.g., second dose) |
|
|
Third instance |
|
|
First categorical instance |
|
|
Second instance |
|
|
Third instance |
|
|
Specification field for first entry |
|
These suffixes are used in parallel and are crucial for distinguishing multiple entries of the same concept within a participant record.
4. Section Mapping¶
Each variable is linked to a logical section (e.g., INCLUSION CRITERIA, DEMOGRAPHICS, INTERVENTIONS) that determines its role in the clinical case report form (CRF). This section assignment ensures that CRFs are structured clearly and reproducibly.
5. Branching Logic Patterns¶
Suffixes like _oth, _spec, _ongoing, _dur, and _site often imply conditional visibility, depending on the answer to a parent field:
expo14_yn = Yes→ showexpo14_typeoth_specinter_suppleo2_ongoing = Yes→ may showinter_suppleo2_durdemog_gender = Other→ showdemog_gender_oth
These dependencies must be respected in CRF tools like BRIDGE to ensure logical flow and data consistency.
6. Variable Types¶
ARC uses various variable types to capture structured, semi-structured, and free-text data. Each type implies specific data constraints and rendering logic:
Type |
Description |
|---|---|
|
Free-text entry, typically short answers |
|
A list of options drawn from an external source, modifiable by the user |
|
Single choice from a set of predefined options |
|
Multiple selections allowed from a list |
|
Similar to |
|
Like |
|
Date field (day-month-year format) |
|
Date and time field in DMY format |
|
Numeric input, often with |
|
Calculated field (derived from other variables) |
|
Field for file upload |
|
Long free-text, typically for comments or additional detail |
|
Static text or headers, not a question or input field |
7. Unit-Selectable Variables¶
Some numeric variables are associated with “Select Units” fields. These allow users to specify the measurement unit alongside the value.
Common Unit Systems:¶
Concept |
Example Units |
|---|---|
Height |
cm, inches |
Weight |
kg, pounds |
Temperature |
°C, °F |
Blood pressure |
mmHg |
Structure:¶
These are typically implemented as paired fields, for example:
demog_weight→ select unit (radio/dropdown)demog_weight_kg,demog_weight_lb→ numeric input fields (conditional on unit)
Such fields allow flexibility across settings while preserving data harmonization. The recommended unit (standardized) is often pre-selected via presets in BRIDGE.