Physiological recordings

Example datasets

Example datasets with physiological data have been formatted using this specification and can be used for practical guidance when curating a new dataset:

General specifications

Continuous (that is, regularly sampled over time at a fixed frequency) physiological recordings such as cardiac and respiratory signals, and asynchronous events corresponding to those signals MAY be specified using compressed tabular files (TSV.GZ file). TSV.GZ files MUST be accompanied by a JSON file with the same name as their corresponding tabular file but with a .json extension.

Template:

sub-<label>/
    [ses-<label>/]
        anat/
            <matches>[_recording-<label>]_physio.json
            <matches>[_recording-<label>]_physio.tsv.gz
            <matches>[_recording-<label>]_physioevents.json
            <matches>[_recording-<label>]_physioevents.tsv.gz
        beh/
            <matches>[_recording-<label>]_physio.json
            <matches>[_recording-<label>]_physio.tsv.gz
            <matches>[_recording-<label>]_physioevents.json
            <matches>[_recording-<label>]_physioevents.tsv.gz
        dwi/
            <matches>[_recording-<label>]_physio.json
            <matches>[_recording-<label>]_physio.tsv.gz
            <matches>[_recording-<label>]_physioevents.json
            <matches>[_recording-<label>]_physioevents.tsv.gz
        eeg/
            <matches>[_recording-<label>]_physio.json
            <matches>[_recording-<label>]_physio.tsv.gz
            <matches>[_recording-<label>]_physioevents.json
            <matches>[_recording-<label>]_physioevents.tsv.gz
        emg/
            <matches>[_recording-<label>]_physio.json
            <matches>[_recording-<label>]_physio.tsv.gz
            <matches>[_recording-<label>]_physioevents.json
            <matches>[_recording-<label>]_physioevents.tsv.gz
        func/
            <matches>[_recording-<label>]_physio.json
            <matches>[_recording-<label>]_physio.tsv.gz
            <matches>[_recording-<label>]_physioevents.json
            <matches>[_recording-<label>]_physioevents.tsv.gz
        ieeg/
            <matches>[_recording-<label>]_physio.json
            <matches>[_recording-<label>]_physio.tsv.gz
            <matches>[_recording-<label>]_physioevents.json
            <matches>[_recording-<label>]_physioevents.tsv.gz
        meg/
            <matches>[_recording-<label>]_physio.json
            <matches>[_recording-<label>]_physio.tsv.gz
            <matches>[_recording-<label>]_physioevents.json
            <matches>[_recording-<label>]_physioevents.tsv.gz
        motion/
            <matches>[_recording-<label>]_physio.json
            <matches>[_recording-<label>]_physio.tsv.gz
            <matches>[_recording-<label>]_physioevents.json
            <matches>[_recording-<label>]_physioevents.tsv.gz
        nirs/
            <matches>[_recording-<label>]_physio.json
            <matches>[_recording-<label>]_physio.tsv.gz
            <matches>[_recording-<label>]_physioevents.json
            <matches>[_recording-<label>]_physioevents.tsv.gz
        perf/
            <matches>[_recording-<label>]_physio.json
            <matches>[_recording-<label>]_physio.tsv.gz
            <matches>[_recording-<label>]_physioevents.json
            <matches>[_recording-<label>]_physioevents.tsv.gz
        pet/
            <matches>[_recording-<label>]_physio.json
            <matches>[_recording-<label>]_physio.tsv.gz
            <matches>[_recording-<label>]_physioevents.json
            <matches>[_recording-<label>]_physioevents.tsv.gz

Legend:

For more information about filename elements (for example, entities, suffixes, extensions), follow the links embedded in the filename template.
<matches> is a placeholder to denote an arbitrary (and valid) sequence of entities and labels at the beginning of the filename (only BIDS "raw").
<source-entities> is a placeholder to denote an arbitrary sequence of entities and labels at the beginning of the filename matching a source file from which the file derives (only BIDS-Derivatives).
Filename entities or directories between square brackets (for example, [_ses-<label>]) are OPTIONAL.
Some entities may only allow specific values, in which case those values are listed in <>, separated by |.
_<suffix> means that there are several (>6) valid suffixes for this filename pattern.
.<extension> means that there are several (>6) valid extensions for this file type.
[.gz] means that both the unzipped and gzipped versions of the extension are valid.

The recording-<label> entity is OPTIONAL, and is described in Continuous physiological recordings, below.

Caution

Columns of TSV.GZ files MUST be defined in the corresponding JSON sidecar and the tabular content MUST NOT include a header line.

As a consequence, when supplying a <matches>_<physio|physioevents>.tsv.gz file, an accompanying <matches>_<physio|physioevents>.json MUST be supplied as well.

For multi-echo data, a single _physio.<tsv.gz|json> file without the echo-<index> entity applies to all echos of a particular run. For example:

└─ sub-01/
   └─ func/
      ├─ sub-01_task-nback_run-1_echo-1_bold.nii.gz 
      ├─ sub-01_task-nback_run-1_echo-2_bold.nii.gz 
      ├─ sub-01_task-nback_run-1_echo-3_bold.nii.gz 
      └─ sub-01_task-nback_run-1_physio.tsv.gz

This specification section first describes the organization of continuous physiological recordings, and then events corresponding to the physiological recordings. Finally, the remainder of the document describes specific types of continuous recordings such as eye-tracking.

Continuous physiological recordings

Continuous physiological recordings, such as pulse monitoring, electrocardiogram, respiratory movement measured with a respiration belt, gas concentration, or eye-tracking, MUST use _physio.<tsv.gz|json> pairs.

Storing different recordings

Recorded physio data MUST be split into separate data files in case of difference in top-level metadata like SamplingFrequency, Software, and Manufacturer of the main recording device (i.e., data source). These top-level metadata are discussed in the following section.

Data with common top-level metadata MAY be kept aggregated in one file otherwise, or split based on channel type, if preferred. The sole exception is eye tracking data, that MUST be split in its own file, following its specification.

We RECOMMEND keeping different files from different recording devices separate, but for easier inspection and analysis they can kept together to get a clearer picture of what the fluctuations describe (e.g., looking at ventilation and respiration together, or PPG and ECG for motion artifacts).

We RECOMMEND to store trigger signals recorded alongside physiological channels in the same file when concurrent modalities are collected (e.g. functional MRI or EEG).

The recording-<label> entity MAY be used to distinguish between several recording files. Recordings with different metadata such as sampling frequencies or recording device MUST be stored in separate files with different recording-<label> entities.

It is possible that the recording-<label> entity uses terms that could be confused with metadata field values, such as MeasurementType or SamplingFrequency. In that case, the lowest metadata level available should always be interpreted as the most reliable information. For instance, if the file name contains recording-1000hz but the SamplingFrequency metadata indicates a sampling frequency of 100Hz, data MUST be interpreted as being sampled at 100 Hz. Similarly, if the entity recording-ecg is used, but the MeasurementType metadata of the contained columns indicate “ppg” and “Ventilation”, the data MUST be interpreted as PPG and Ventilation, and not ECG.

For example:

Splitting recorded data into separate physio data files

└─ sub-001/
   └─ ses-01/
      └─ physio/
         ├─ sub-001_ses-01_recording-scr_physio.json 
         ├─ sub-001_ses-01_recording-scr_physio.tsv.gz 
         ├─ sub-001_ses-01_recording-ecg_physio.json 
         ├─ sub-001_ses-01_recording-ecg_physio.tsv.gz 
         ├─ sub-001_ses-01_recording-resp_physio.json 
         └─ sub-001_ses-01_recording-resp_physio.tsv.gz

Combining recorded data into one pair of physio data files

└─ sub-001/
   └─ ses-01/
      └─ physio/
         ├─ sub-001_ses-01_physio.json 
         └─ sub-001_ses-01_physio.tsv.gz

└─ dataset/
   └─ sub-<label>/
      └─ ses-<label>/
         └─ physio/
            ├─ sub-001_ses-01_physio.json 
            └─ sub-001_ses-01_physio.tsv.gz

For example, given a BOLD acquisition of a breath-holding task (task-bht) for which pulse and respiratory movement were sampled at different frequencies, recordings are separated as follows:

└─ sub-01/
   └─ func/
      ├─ sub-01_task-bht_bold.nii.gz 
      ├─ sub-01_task-bht_recording-cardiac_physio.json 
      ├─ sub-01_task-bht_recording-cardiac_physio.tsv.gz 
      ├─ sub-01_task-bht_recording-respiratory_physio.json 
      └─ sub-01_task-bht_recording-respiratory_physio.tsv.gz

Metadata fields for <matches>_physio.json files. General metadata fields include SamplingFrequency, StartTime, Columns, and Manufacturer, in addition to individual column descriptions. Each individual column in the TSV file MAY be documented as its own field in the JSON file (identical to the practice in other TSV+JSON file pairs).

Caution

Recordings with different key metadata MUST be split into separate files.

When key metadata such as sampling frequencies, manufacturers varies between recordings, tabular data MUST be split into separate files. In such cases, the recording-<label> entity MUST be used to distinguish these files.

Metadata sidecar files (<matches>_physio.json) MAY define the field PhysioType to indicate a specific type of recording. The default value of PhysioType is "generic", and MUST be assumed if the PhysioType metadata is not defined. Specific recording types (that is, when PhysioType takes a valid value other than "generic") have separate prescriptions for columns in the TSV.GZ files and corresponding metadata specifications.

RECOMMENDED

Using specific recording types is RECOMMENDED when available in the specification.

The allowed physiological recording types encoded by PhysioType and their corresponding metadata specifications are described in subsection Specific physiological signal types below, and its subsections:

eye-tracking (subsection Eye-tracking).

The following table specifies metadata fields for "generic" recordings:

Key name	Requirement Level	Data type	Description
SamplingFrequency	REQUIRED	number	Sampling frequency (in Hz) of all the data in the recording, regardless of their type (for example, `2400`).
StartTime	REQUIRED	number	Start time in seconds in relation to the start of acquisition of the first data sample in a reference file sharing the same "ConcurrenceGroup" identifier (negative values are allowed). Within each set of “ConcurrenceGroup” files, a reference file MUST be designated with a “StartTime” equaling 0. This data MAY be specified with sub-second precision using the syntax `s[.000000]`, where `s` reflects whole seconds, and `.000000` reflects OPTIONAL fractional seconds. If no ConcurrenceGroup identifier is defined, the StartTime should be set to 0.
Columns	REQUIRED	array of strings	Names of columns in file.
PhysioType	RECOMMENDED	string	Defines the specific type of physiological recording. For backwards compatibility, the default value is `"generic"`. Must be one of: `"generic"`, `"eyetrack"`, `"enriched"`.

Hardware information. Details about the hardware MAY be stored with the following metadata fields.

Key name	Requirement Level	Data type	Description
DeviceSerialNumber	OPTIONAL	string	The serial number of the equipment that recorded the measurements: that is, the main recording device. A pseudonym can also be used to prevent the equipment from being identifiable, so long as each pseudonym is unique within the dataset.
Manufacturer	OPTIONAL	string	Manufacturer of the main equipment that recorded the measurements.
ManufacturersModelName	OPTIONAL	string	Manufacturer's model name of the main equipment that recorded the measurements.
SoftwareVersions	OPTIONAL	string	Manufacturer's designation of software version of the equipment that produced the measurements.

Additional metadata may be included as in any TSV file to specify, for example, the units of the recorded time series.

Column naming recommendations for "generic" recordings. To store pulse or breathing measurements, or the scanner trigger signal, the following naming conventions SHOULD be used for the column names:

Column name	Requirement Level	Data type	Description
cardiac	OPTIONAL	number	continuous pulse measurement
respiratory	OPTIONAL	number	continuous breathing measurement
trigger	OPTIONAL	number	continuous measurement of the scanner trigger signal
Additional Columns	OPTIONAL	`n/a`	Additional columns are allowed.

Please note that the specification of columns such as cardiac, respiratory, and trigger above follow the general specifications for tabular files:

Key name	Requirement Level	Data type	Description
LongName	OPTIONAL	string	Long (unabbreviated) name of the column.
Description	RECOMMENDED	string	Free-form natural language description. The description of the column.
Levels	RECOMMENDED	object	For categorical variables: An object of possible values (keys) and their descriptions (values).
Units	RECOMMENDED	string	Measurement units for the associated variable. SI units in CMIXF formatting are RECOMMENDED (see Units).
Delimiter	OPTIONAL	string	If rows in a column may be interpreted as a lists of values, the character that separates one value from the next.
TermURL	RECOMMENDED	string	URL pointing to a formal definition of this type of data in an ontology available on the web. For example: https://www.ncbi.nlm.nih.gov/mesh/68008297 for "male".
HED	OPTIONAL	string or object of strings	Hierarchical Event Descriptor (HED) information, see the HED Appendix for details.

Examples. Let's encode cardiac and respiratory recordings, as well as a trigger signal, the three of them sampled at 100.0 Hz by the same device during a behavioral task:

└─ sub-01/
   └─ func/
      ├─ sub-01_task-nback_physio.json 
      └─ sub-01_task-nback_physio.tsv.gz

In this example, the contents of sub-01_task-nback_physio.tsv.gz after decompression are:

0	34	110	0
1	44	112	0
2	23	100	1

and the header-less TSV.GZ contents are described with the following metadata sub-01_task-nback_physio.json where the Columns field defines the names corresponding to the three columns above:

{
    "Columns": ["cardiac", "respiratory", "trigger"],
    "Manufacturer": "Brain Research Equipment ltd.",
    "PhysioType": "generic",
    "SamplingFrequency": 100.0,
    "StartTime": -22.345,
    "cardiac": {
      "Description": "continuous pulse measurement",
      "Units": "mV"
    },
    "respiratory": {
      "Description": "continuous measurements by respiration belt",
      "Units": "mV"
    },
    "trigger": {
      "Description": "continuous measurement of the scanner trigger signal",
      "Units": "V"
    }
}

The example shows the three REQUIRED metadata entries Columns, SamplingFrequency, and StartTime. Columns are further described following the specifications for tabular files, indicating Description and Units fields. Other fields, such as TermURL, LongName, MAY be included.

Because a missing PhysioType is assumed to be "generic", the following sidecar is equivalent:

{
    "Columns": ["cardiac", "respiratory", "trigger"],
    "Manufacturer": "Brain Research Equipment ltd.",
    "SamplingFrequency": 100.0,
    "StartTime": -22.345,
    "cardiac": {
      "Description": "continuous pulse measurement",
      "Units": "mV"
    },
    "respiratory": {
      "Description": "continuous measurements by respiration belt",
      "Units": "mV"
    },
    "trigger": {
      "Description": "continuous measurement of the scanner trigger signal",
      "Units": "V"
    }
}

If the "cardiac" and "respiratory" signals above were acquired at different sampling frequencies, then the recordings MUST be separated into two files disambiguated by the recording-<label> entity:

└─ sub-01/
   └─ func/
      ├─ sub-01_task-nback_recording-cardiac_physio.json 
      ├─ sub-01_task-nback_recording-cardiac_physio.tsv.gz 
      ├─ sub-01_task-nback_recording-respiratory_physio.json 
      └─ sub-01_task-nback_recording-respiratory_physio.tsv.gz

Physiology "events"

Discontinuous data associated with continuous recordings stored in <matches>_physio.tsv.gz files MAY be specified following the summary template pattern above.

Physiology "events" files <matches>_recording-<label>_physioevents.tsv.gz MUST follow specific column specifications:

Column name	Requirement Level	Data type	Description
onset	REQUIRED	number	Onset of the event relative to the timeline defined by the corresponding physiological recording file. Typically, it will correspond to a timestamp issued by the recording device. This column must appear first in the file.
duration	RECOMMENDED	number	Duration of the event (measured from onset) in seconds. Must always be either zero or positive (or `n/a` if unavailable). A "duration" value of zero implies that the delta function or event is so short as to be effectively modeled as an impulse. This column may appear anywhere in the file. Must be a number greater than or equal to 0.
trial_type	OPTIONAL	string	Primary categorisation of each trial to identify them as instances of the experimental conditions. For example: for a response inhibition task, it could take on values `go` and `no-go` to refer to response initiation and response inhibition experimental conditions. This column may appear anywhere in the file.
message	OPTIONAL	string	Brief free-text description of a message (for example in a log generated by a device), or other information of interest. This column may appear anywhere in the file.
Additional Columns	OPTIONAL	`n/a`	Additional columns are allowed.

The following table specifies metadata fields for the <matches>[_recording-<label>]_physioevents.json file.

Key name	Requirement Level	Data type	Description
Columns	REQUIRED	array of strings	Names of columns in file.
Description	RECOMMENDED	string	Free-form natural language description.
OnsetSource	REQUIRED	string	An existing column name in an corresponding recoding data TSV file that indexes or is correspondent to the `"onset"` column of the present TSV file. For example, `"timestamp"` column.

The REQUIRED OnsetSource metadata specifies the interpretation of the values of the onset column. If OnsetSource is the name of a column in the associated <matches>[_recording-<label>]_physio.tsv.gz file, the values have the same interpretation as values in the named column.

For example, considering the following structure:

└─ sub-01/
   └─ func/
      ├─ sub-01_task-nback_physio.json 
      ├─ sub-01_task-nback_physio.tsv.gz 
      ├─ sub-01_task-nback_physioevents.json 
      └─ sub-01_task-nback_physioevents.tsv.gz

The decompressed contents of sub-01_task-nback_physio.tsv.gz are:

13894432329	10.1
13894432330	10.0
13894432331     9.5
13894432332     9.2
13894432333     9.0
13894432334	10.2
13894432335	10.3
13894432336	10.1

And sub-01_task-nback_physio.json defines timestamp column:

{
    "SamplingFrequency": 100.0,
    "StartTime": -22.345,
    "Columns": ["timestamp", "cardiac"],
    "cardiac": {
        "Description": "continuous pulse measurement",
        "Units": "mV"
    },
    "timestamp": {
        "Description": "a continuously increasing identifier of the sampling time registered by the device",
        "Units": "ms",
        "Origin": "System startup"
    }
}

The decompressed contents of the corresponding sub-01_task-nback_physioevents.tsv.gz are:

13894432325	Ready
13894432331	Synchronous recalibration triggered
13894432334	External message received: new block

To indicate that the first column (onset) is to be interpreted as a timestamp, the OnsetSource MUST be set to "timestamp" in sub-01_task-nback_physioevents.json:

{
    "Columns": ["onset", "message"],
    "Description": "Messages logged by the measurement device",
    "OnsetSource": "timestamp"
}

If there is no appropriate source column in <matches>[_recording-<label>]_physio.tsv.gz, OnsetSource MAY be set to "n/a". In this case, the values of onset MUST be interpreted as row indices into the physio file, with the first row having index zero (0). Negative onsets are possible, and such events are interpreted as occurring prior to the start of the recording, at the same sampling rate.

For example, the above sub-01_task-nback_physioevents.tsv.gz could be equivalently written:

-3	Ready
3	Synchronous recalibration triggered
6	External message received: new block

To indicate that the first column (onset) is to be interpreted as an index, the OnsetSource is set to "n/a" in sub-01_task-nback_physioevents.json:

{
    "Columns": ["onset", "message"],
    "Description": "Messages logged by the measurement device",
    "OnsetSource": "n/a"
}

Specific physiological signal types

Enriched physiological metadata

JSON Data files. All metadata we are proposing are either OPTIONAL or RECOMMENDED, and they are meant to enrich the current "generic" PhysioType. However, we are also suggesting the introduction of a "enriched" PhysioType, that will differ from "generic" because one proposed metadata, MeasureType, will be REQUIRED rather than RECOMMENDED. Equally, the Units metadata will be REQUIRED instead of RECOMMENDED in this case.

Compared to the current BIDS specification (1.10.0), at the file level we are adding one metadata, the OPTIONAL SubjectPosition, indicating the position of the subject during the data collection (see below "Metadata fields used in top level metadata").

When specifying column names, columns MUST have unique names. All such data columns MUST be appropriately defined in the JSON metadata.

Example:

{
  "Columns": ["screda1", "screda2", "ecg", "ppg"],
  "SamplingFrequency": 1000,
  "SubjectPosition": "sitting",
  "PhysioType": "enriched",
  "screda1": {
    "MeasureType": "EDA-phasic",
    "Units": "mS",
    "Placement": "Thenar",
  },
  "screda2": {
    "MeasureType": "EDA-tonic",
    "Units": "mS",
    "Placement": "Hypothenar",
  },
  "ecg": {
    "MeasureType": "ECG",
    "Units": "mV",
    "Placement": "II",
  },
  "ppg": {
    "MeasureType": "PPG",
    "Units": "au",
    "Placement": "Right earlobe",
  }
}

As described in the table below ("Metadata fields for column description."), this BEP is adding a few metadata to describe columns.

The most important one is MeasureType, a RECOMMENDED metadata that indicates the actual nature of the data in the column.
- This metadata value is a string that MUST come from a set of keywords.
- This set of keywords can be expanded in the future to include more physiological modalities.
- When the file-level metadata PhysioType is "enriched", MeasureType becomes a REQUIRED field for each column.

This metadata is meant to be the most reliable indicator of the type of data contained in the described column. Having a reliable and standardized indication of what type of data is being handled allows automated modality specific data processing and prevents data misuse.

Furthermore, we are proposing that Units becomes a REQUIRED metadata when PhysioType is "enriched". Not only this helps to better reflect the possible quantitative nature of physiological data, but since similarly labelled data (e.g. Ventilation) can be expressed in different units, indicating different underlying processes, sensors, or levels of real-time preprocessing and data manipulation (e.g. transformation from Volts to millimeters of Mercury), making this field more explicit in the section regarding physiological data will help improve data interpretation. Specification of units SHOULD follow the International System of Units (see BIDS specification).

We are also introducing a Placement RECOMMENDED metadata, that describes the position of the sensor during data collection. For instance, a file could have three columns of ventilation data, one collected at the navel, one at the diaphragm, and one at the armpit level, in which case Placement values would be “Navel”, “Diaphragm”, and “Armpit” respectively. In case the data describes gas concentration, such as CO2 or O2, Placement SHOULD be used to indicate if a “Nose” cannula versus a “Mouth” mouthpiece or a “Mask” was used.

The three metadata at this level describing hardware are:

ChannelManufacturersModelName (RECOMMENDED)
ChannelManufacturers (RECOMMENDED)
ChannelDeviceSerialNumber (OPTIONAL)

These metadata are meant to describe the nature of the equipment used to record data. Different components from different manufacturers could be used at the same time in a “patchwork” approach in which a sensor or amplifier from manufacturer A is connected to the recording device of manufacturer B, and even the same manufacturer could provide two or more options to measure the same type of data. Many setups that differ in this way introduce a potential difference in data processing (e.g. digital vs analogical lags, delays and sharpness of the recording, quantification, …).

Thus, we RECOMMEND to increase the granularity of the setup description for each column, and we RECOMMEND to report names and manufacturers (when different from the main unit) of sensors, connective elements (e.g. cannulae or cables), and amplifiers. Serial numbers MAY be reported as well.

In this framework, it is crucial to distinguish between the different fields available for specifying recording equipment in the meta-data: at the top-level, the main recording device and software are characterized in meta-data fields such as SoftwareModels and DeviceSerialNumber, while at the column-level, information about channel-specific hardware is characterized in meta-data fields such as ChannelDeviceSerialNumber.

We provide the example shown above to assist in determining the main recording device in common physiological acquisition set-ups. In the example shown above, three different recording systems are being used to concurrently acquire physiological data. The first system acquires two channels of physiological data with software A and main recording device ‘a’, which both would be specified using the top-level fields in the accompanying meta-data. Upstream, hardware such as amplifiers, filters, cables, and sensors would be specified using column-level fields specific to each channel in the accompanying meta-data. In the second system, one channel of physiological data is being acquired by main recording device ‘b’ and wirelessly transmitted to software B. In this case, the sensor attached to device ‘b’ can still be specified using column-level meta-data fields if it is an independent product. In the third system, data is acquired by a physiological monitoring unit which is integrated with an MRI scanner (device ‘c’), which itself acts as the main recording device. In case of using networked middleware systems such as the lab streaming layer, where the data may be centrally recorded, the central recording computer itself MAY be considered the main recording device.

Finally, the AmplifierSettings is a dictionary meant to be filled with potential amplifier settings that can manipulate the data collection at the source, e.g. low-pass filters or DC/AC currents. Because each amplifier and each manufacturer have different settings, we cannot define further the content of this dictionary, but we suggest using manufacturer specific pairs of keys and values. In this dictionary, we also SUGGEST reporting eventual data transformations (e.g. the exact formula used to transform gas pressure from measured Voltage to millimetres of Mercury).

More information about the metadata entities contained in the JSON files can be found in the tables below.

Metadata fields used in top level metadata.

Key name	Requirement Level	Data type	Description
SamplingFrequency	REQUIRED	number	Sampling frequency (in Hz) of all the data in the recording, regardless of their type (for example, `2400`).
StartTime	REQUIRED	number	Start time in seconds in relation to the start of acquisition of the first data sample in a reference file sharing the same "ConcurrenceGroup" identifier (negative values are allowed). Within each set of “ConcurrenceGroup” files, a reference file MUST be designated with a “StartTime” equaling 0. This data MAY be specified with sub-second precision using the syntax `s[.000000]`, where `s` reflects whole seconds, and `.000000` reflects OPTIONAL fractional seconds. If no ConcurrenceGroup identifier is defined, the StartTime should be set to 0.
Columns	REQUIRED	array of strings	Names of columns in file.

Metadata fields for column description.

Column name	Requirement Level	Data type	Description
cardiac	OPTIONAL	number	continuous pulse measurement
respiratory	OPTIONAL	number	continuous breathing measurement
trigger	OPTIONAL	number	continuous measurement of the scanner trigger signal
Additional Columns	OPTIONAL	`n/a`	Additional columns are allowed.

MeasureType descriptions.

MeasureType	Name	Description
Trigger	Trigger	Digital (binary TTL) or analog (TTL in Volt) values indicating scanner triggers.
PPG	Photoplethysmography	Continuous optical signal capturing the cardiac pulsation.
ECG	Electrocardiography	Continuous electrical signal capturing the cardiac activity.
Ventilation	Ventilation	Continuous breathing measurement.
CO2	Carbon dioxide	Continuous measurement of the carbon dioxide concentration in expired air.
O2	Oxygen	Continuous measurement of the oxygen concentration from respiratory gases.
PetCO2	End-tidal carbon dioxide	Continuous measurement of the end-tidal pressure of carbon dioxide at the end of an exhalation.
PetO2	End-tidal oxygen	Continuous measurement of the end-tidal pressure of oxygen at the end of an inhalation.
EDA-tonic	Electrodermal activity, tonic component	Continuous measurement of low-frequency changes in electrodermal activity, also known as skin conductance level.
EDA-phasic	Electrodermal activity, phasic component	Continuous measurement of high-frequency changes in electrodermal activity, also known as skin conductance response.
EDA-total	Electrodermal activity	Continuous measurement of the changes in electrical properties of the skin.
BP	Blood pressure	Continuous measurement of the blood pressure waveform representing the changes in arterial pressure over time.
Other	Other	Any other type of channel.

Eye-tracking

Example datasets

Example datasets with eye-tracking data have been formatted using this specification and can be used for practical guidance when curating a new dataset:

Combined fMRI and eye-tracking data in a resting-state task, measured with an Eyelink (SR research). Human participant kept their gaze steady at the screen center.

BIDS dataset
Combined behavioral and eye-tracking data, measured with and Eyelink (SR Research). Human participants freely viewed as set of natural images. Published paper: https://doi.org/10.1523/ENEURO.0287-23.2023

BIDS dataset

Setting PhysioType to the keyword "eyetrack" specifies that the physiological recordings in the <matches>_physio.tsv.gz have been acquired with an eye-tracker. In the following, eye-tracker refers to the apparatus allowing the recording of gaze position, and, optionally, pupil size.

Eye-tracking data MUST be stored following the general specifications for "generic" physiological recordings. However, it is REQUIRED that recordings corresponding to each eye (and/or cyclopean or averaged signals for binocular eye-trackers providing a third recording) are split into files with different recording-<label>. Therefore, the use of recording-<label> is REQUIRED with eye-tracking data. The values "eye1", "eye2", and "eye3" are RECOMMENDED as the respective labels for the recording-<label> entity.

MANDATORY metadata

The correspondence of labels and the recorded eye MUST be encoded by the MANDATORY RecordedEye metadata.

The recording-<label> entity MAY take other values such as "left", "cyclopean", or "right" corresponding to the RecordedEye metadata. However, it is RECOMMENDED that metadata is not encoded in the file names to avoid conflicts between filenames and metadata. For example, if recording-<label> takes the value "left" but the corresponding sidecar JSON file contains a definition of RecordedEye being "right".

Eye-tracking files <matches>_recording-<label>_physio.tsv.gz MUST follow specific column prescriptions:

Column name	Requirement Level	Data type	Description
timestamp	REQUIRED	number	Timestamp issued by the eye-tracker indexing the continuous recordings corresponding to the sampled eye. This column must appear first in the file.
x_coordinate	REQUIRED	number	Gaze position x-coordinate of the recorded eye, in the coordinate units specified in the corresponding metadata sidecar. This column must appear second in the file.
y_coordinate	REQUIRED	number	Gaze position y-coordinate of the recorded eye, in the coordinate units specified in the corresponding metadata sidecar. This column must appear third in the file.
pupil_size	OPTIONAL	number	Pupil size, as area or diameter, of the recorded eye, in the units specified in the corresponding metadata sidecar. It is RECOMMENDED to indicate the particular type of pupil size that is being recorded in the `Description` field of this column. It is RECOMMENDED to specify the `Units` field corresponding to this column. This column may appear anywhere in the file.
Additional Columns	OPTIONAL	`n/a`	Additional columns are allowed.

Please note that the specification of columns such as timestamp, x_coordinate, y_coordinate, and pupil_size follow the general specifications for tabular files and MAY define metadata fields such as LongName, Description, Levels, or TermURL. However, in the case of eye-tracking, the metadata entry Units, becomes REQUIRED for the columns x_coordinate and y_coordinate:

Key name	Requirement Level	Data type	Description
Units	REQUIRED	string	Measurement units for the associated variable. SI units in CMIXF formatting are RECOMMENDED (see Units).

The following table specifies metadata fields for the <matches>_recording-<label>_physio.json file:

Key name	Requirement Level	Data type	Description
SamplingFrequency	REQUIRED	number	Sampling frequency (in Hz) of all the data in the recording, regardless of their type (for example, `2400`).
StartTime	REQUIRED	number	Start time in seconds in relation to the start of acquisition of the first data sample in a reference file sharing the same "ConcurrenceGroup" identifier (negative values are allowed). Within each set of “ConcurrenceGroup” files, a reference file MUST be designated with a “StartTime” equaling 0. This data MAY be specified with sub-second precision using the syntax `s[.000000]`, where `s` reflects whole seconds, and `.000000` reflects OPTIONAL fractional seconds. If no ConcurrenceGroup identifier is defined, the StartTime should be set to 0.
Columns	REQUIRED	array of strings	Names of columns in file.
DeviceSerialNumber	OPTIONAL	string	The serial number of the equipment that recorded the measurements: that is, the main recording device. A pseudonym can also be used to prevent the equipment from being identifiable, so long as each pseudonym is unique within the dataset.
Manufacturer	OPTIONAL	string	Manufacturer of the main equipment that recorded the measurements.
ManufacturersModelName	OPTIONAL	string	Manufacturer's model name of the main equipment that recorded the measurements.
SoftwareVersions	OPTIONAL	string	Manufacturer's designation of software version of the equipment that produced the measurements.
PhysioType	REQUIRED	string	Defines the specific type of physiological recording. For backwards compatibility, the default value is `"generic"`. Must be one of: `"generic"`, `"eyetrack"`, `"enriched"`.
RecordedEye	REQUIRED	string	Indicates the eye being tracked, for example, `"left"` or `"right"`. It SHOULD be set to `"cyclopean"` for recordings combining both eyes as potentially generated by binocular eye-trackers. Must be one of: `"left"`, `"right"`, `"cyclopean"`.
SampleCoordinateSystem	REQUIRED	string	Coordinate system of the gaze position recordings. Generally eye-trackers are set to use `"gaze-on-screen"` coordinate system, but you may use `"eye-in-head"` or `"gaze-in-world"`. If none of the three default options properly describe the coordinate system, the `"custom"` keyword MUST be employed and the coordinate system MUST be described using additional metadata entries. Must be one of: `"gaze-on-screen"`, `"eye-in-head"`, `"gaze-in-world"`, `"custom"`.
AverageCalibrationError	OPTIONAL	number	Average calibration error in degrees of visual angle.
CalibrationCount	OPTIONAL	integer	The number of calibrations corresponding to this run. Must be a number greater than or equal to 0.
CalibrationPosition	OPTIONAL	array of arrays	A list of `[x, y]` coordinates in the `CalibrationUnit`. For example, using 5 positions calibration presented on an HD screen, it could be `[[960,50],[960,540],[960,1030],[50,540],[1870,540]]`.
CalibrationType	OPTIONAL	string	The type of the calibration procedure executed last. For example the `"H3"` for horizontal calibration with 3 positions or `"HV9"` for horizontal and vertical calibration with 9 positions, or one point calibration.
CalibrationUnit	OPTIONAL	string	Unit of `"CalibrationPosition"`. Must be one of: `"pixel"`, `"mm"`, `"cm"`.
EyeTrackerDistance	OPTIONAL	number or array of numbers	Distance (in meters) between the eye-tracker and the participant eye(s). Distance can either be expressed as a single numeric value indicating the shortest distance between the participant's eye and the eye-tracker's camera, or a three-coordinates array indicating the X, Y, Z distances between the participant's eye and the eye-tracker's camera.
EyeTrackingMethod	OPTIONAL	string	Method used to track gaze or pupil position: "P–CR" for video based eye-tracking using pupil and corneal reflection; "DPI" for Dual-Purkinje Imaging system, "SSC" for scleral search coils; "EOG" for electro-oculogram; "Limbus" for trackers estimating limbus borders between the iris and sclera; or other.
MaximalCalibrationError	OPTIONAL	number	Maximal calibration error in degrees of visual angle.
PupilFitMethod	OPTIONAL	string	The method employed for fitting the pupil, for example `"centre-of-mass"` or `"ellipse"`. If `"centre-of-mass"` or `"ellipse"` method is used, it is RECOMMENDED to use these exact labels.
RawDataFilters	OPTIONAL	string	Filter settings applied to eye-movement raw data.

Comprehensively documenting the calibration metadata is RECOMMENDED.

Eye-tracking files <matches>_recording-<label>_physio.tsv.gz MAY be annotated with a corresponding <matches>_recording-<label>_physioevents.tsv.gz file. The <matches>_recording-<label>_physioevents.tsv.gz file MAY be employed to record discontinuous model parameters generated by the eye-tracker, for example, those derived from the saccade and blinks model some eye-trackers produce.

Important

For eye-tracking recordings where "SampleCoordinateSystem" is set to "gaze-on-screen", the following fields pertaining to <matches>_events.json escalate to REQUIRED as they are considered essential in eye-tracking data analysis:

StimulusPresentation.ScreenDistance,
StimulusPresentation.ScreenOrigin,
StimulusPresentation.ScreenResolution,
StimulusPresentation.ScreenSize.

Examples. The recordings produced by a monocular eye-tracker during a visual search task may display the following structure:

└─ sub-01/
   └─ func/
      ├─ sub-01_task-visualSearch_bold.json 
      ├─ sub-01_task-visualSearch_bold.nii.gz 
      ├─ sub-01_task-visualSearch_events.json 
      ├─ sub-01_task-visualSearch_events.tsv 
      ├─ sub-01_task-visualSearch_recording-eye1_physio.json 
      ├─ sub-01_task-visualSearch_recording-eye1_physio.tsv.gz 
      ├─ sub-01_task-visualSearch_recording-eye1_physioevents.json 
      └─ sub-01_task-visualSearch_recording-eye1_physioevents.tsv.gz

The above example is extended to a binocular eye-tracker producing three signals (left and right eyes, plus a cyclopean recording), as follows:

└─ sub-01/
   └─ func/
      ├─ sub-01_task-visualSearch_bold.json 
      ├─ sub-01_task-visualSearch_bold.nii.gz 
      ├─ sub-01_task-visualSearch_events.json 
      ├─ sub-01_task-visualSearch_events.tsv 
      ├─ sub-01_task-visualSearch_recording-eye1_physio.json 
      ├─ sub-01_task-visualSearch_recording-eye1_physio.tsv.gz 
      ├─ sub-01_task-visualSearch_recording-eye1_physioevents.json 
      ├─ sub-01_task-visualSearch_recording-eye1_physioevents.tsv.gz 
      ├─ sub-01_task-visualSearch_recording-eye2_physio.json 
      ├─ sub-01_task-visualSearch_recording-eye2_physio.tsv.gz 
      ├─ sub-01_task-visualSearch_recording-eye2_physioevents.json 
      ├─ sub-01_task-visualSearch_recording-eye2_physioevents.tsv.gz 
      ├─ sub-01_task-visualSearch_recording-eye3_physio.json 
      ├─ sub-01_task-visualSearch_recording-eye3_physio.tsv.gz 
      ├─ sub-01_task-visualSearch_recording-eye3_physioevents.json 
      └─ sub-01_task-visualSearch_recording-eye3_physioevents.tsv.gz

Given the above example file structures, a corresponding sub-01_task-visualSearch_recording-eye1_physio.json sidecar could read:

{
    "DeviceSerialNumber": "17535483",
    "Columns": ["timestamp", "x_coordinate", "y_coordinate", "pupil_size"],
    "Manufacturer": "SR-Research",
    "ManufacturersModelName": "EYELINK II CL v4.56 Aug 18 2010",
    "PhysioType": "eyetrack",
    "RecordedEye": "right",
    "SampleCoordinateSystem": "gaze-on-screen",
    "SamplingFrequency": 1000,
    "SoftwareVersions": "SREB1.10.1630 WIN32 LID:F2AE011 Mod:2017.04.21 15:19 CEST",
    "timestamp": {
        "Description": "a continuously increasing identifier of the sampling time registered by the device",
        "Units": "ms",
        "Origin": "System startup"
    },
    "x_coordinate": {
      "LongName": "Gaze position (x)",
      "Description": "Gaze position x-coordinate of the recorded eye, in the coordinate units specified in the corresponding metadata sidecar.",
      "Units": "pixel"
    },
    "y_coordinate": {
      "LongName": "Gaze position (y)",
      "Description": "Gaze position y-coordinate of the recorded eye, in the coordinate units specified in the corresponding metadata sidecar.",
      "Units": "pixel"
    },
    "pupil_size": {
        "Description": "Pupil area of the recorded eye as calculated by the eye-tracker in arbitrary units (see EyeLink's documentation for conversion).",
        "Units": "arbitrary"
    }
}

Content of sub-01_task-VisualSearch_events.json:

{
   "TaskName": "Visual Search",
   "InstitutionName": "Stanford University",
   "InstitutionAddress": "450 Serra Mall, Stanford, CA 94305-2004, USA",
   "StimulusPresentation": {
       "ScreenDistance": 0.6,
       "ScreenOrigin": ["top", "left"],
       "ScreenRefreshRate": 60,
       "ScreenResolution": [1024, 768],
       "ScreenSize": [0.386, 0.29]
   }
}

Example eye-tracking recording. Given the above sub-01_task-visualSearch_recording-eye1_physio.json metadata specification, the decompressed content of the sub-01_task-visualSearch_recording-eye1_physio.tsv.gz can be:

7186799    416.29    267.39    4612.0
7186800    416.29    268.10    4623.0
7186801    416.20    269.00    4623.0
7186802    415.89    269.60    4613.0
7186803    415.70    269.20    4603.0
7186804    415.60    266.79    4591.0
7186805    415.79    264.60    4589.0
7186806    416.10    263.89    4587.0
7186807    416.29    265.20    4587.0
7186808    416.39    266.50    4588.0
7186809    416.50    266.79    4594.0
7186810    416.50    267.20    4599.0
7186811    416.10    268.00    4609.0
7186812    415.70    268.29    4612.0
7186813    416.00    268.60    4605.0

Example sub-01_task-visualSearch_recording-eye1_physioevents.tsv.gz corresponding to the above eye-tracking recording, after decompressing:

7184392    n/a    n/a         n/a    "NO Reply is disabled for function eyelink_cal_result"
7184392    n/a    n/a         n/a    "RECCFG CR 1000 2 0 R"
7184392    n/a    n/a         n/a    "ELCLCFG TOWER"
7186771    n/a    n/a         n/a    "First task trigger"
7186806    72     fixation    0      n/a
7186879    231    saccade     1      n/a
7187111    6186   fixation    0      n/a
7193298    216    saccade     1      n/a
7193515    1286   fixation    0      n/a
7194802    24     saccade     0      n/a
7194827    2403   fixation    0      n/a
7197231    17     saccade     0      n/a
7197249    1640   fixation    0      n/a
7198890    6      saccade     0      n/a
7198897    1105   fixation    0      n/a
7200003    233    saccade     1      n/a
7200237    184    fixation    0      n/a
7200422    15     saccade     0      n/a
7200438    264    fixation    0      n/a

where the first three rows are logged by the eye-tracker and the fourth row shows a message asynchronously received by the eye-tracker. The remainder of the rows contain a subset of parameters registered by the device, derived from applying a model to identify saccades and blinks. The corresponding sub-01_task-visualSearch_recording-eye1_physioevents.json sidecar would read:

{
    "Columns": ["onset", "duration", "trial_type", "blink", "message"],
    "Description": "Messages logged by the measurement device",
    "OnsetSource": "timestamp",
    "blink": {
      "Description": "Gives status of the eye.",
      "Levels": {
          "0": "Indicates if the eye was open.",
          "1": "Indicates if the eye was closed."
      }
    },
    "message": {
      "Description": "String messages logged by the eye-tracker."
    },
    "trial_type": {
      "Description": "Event type as identified by the eye-tracker's model.",
      "Levels": {
          "fixation": "Indicates a fixation.",
          "saccade": "Indicates a saccade."
      }
    }
}

) }}

Raw Physiological Data

1. File formats and directory structure

1.1 General principles

The file and dataset naming conventions for physiological data follow the common principles of BIDS. When present, physiological recordings SHOULD be stored as compressed tabular files (.tsv.gz format) along with corresponding JSON files for storing metadata fields (see below).

An example of the physio directory structure is shown below:

Template:

sub-<label>/
    [ses-<label>/]
        anat/
            <matches>[_recording-<label>]_physio.json
            <matches>[_recording-<label>]_physio.tsv.gz
        beh/
            <matches>[_recording-<label>]_physio.json
            <matches>[_recording-<label>]_physio.tsv.gz
        dwi/
            <matches>[_recording-<label>]_physio.json
            <matches>[_recording-<label>]_physio.tsv.gz
        eeg/
            <matches>[_recording-<label>]_physio.json
            <matches>[_recording-<label>]_physio.tsv.gz
        emg/
            <matches>[_recording-<label>]_physio.json
            <matches>[_recording-<label>]_physio.tsv.gz
        func/
            <matches>[_recording-<label>]_physio.json
            <matches>[_recording-<label>]_physio.tsv.gz
        ieeg/
            <matches>[_recording-<label>]_physio.json
            <matches>[_recording-<label>]_physio.tsv.gz
        meg/
            <matches>[_recording-<label>]_physio.json
            <matches>[_recording-<label>]_physio.tsv.gz
        motion/
            <matches>[_recording-<label>]_physio.json
            <matches>[_recording-<label>]_physio.tsv.gz
        nirs/
            <matches>[_recording-<label>]_physio.json
            <matches>[_recording-<label>]_physio.tsv.gz
        perf/
            <matches>[_recording-<label>]_physio.json
            <matches>[_recording-<label>]_physio.tsv.gz
        pet/
            <matches>[_recording-<label>]_physio.json
            <matches>[_recording-<label>]_physio.tsv.gz

Legend:

For more information about filename elements (for example, entities, suffixes, extensions), follow the links embedded in the filename template.
<matches> is a placeholder to denote an arbitrary (and valid) sequence of entities and labels at the beginning of the filename (only BIDS "raw").
<source-entities> is a placeholder to denote an arbitrary sequence of entities and labels at the beginning of the filename matching a source file from which the file derives (only BIDS-Derivatives).
Filename entities or directories between square brackets (for example, [_ses-<label>]) are OPTIONAL.
Some entities may only allow specific values, in which case those values are listed in <>, separated by |.
_<suffix> means that there are several (>6) valid suffixes for this filename pattern.
.<extension> means that there are several (>6) valid extensions for this file type.
[.gz] means that both the unzipped and gzipped versions of the extension are valid.

└─ dataset/
   └─ sub-<label>/
      └─ ses-<label>/
         └─ physio/
            ├─ sub-<label>[_ses-<label>]_task-<label>_[recording-<label>]_physio.json 
            └─ sub-<label>[_ses-<label>]_task-<label>_[recording-<label>]_physio.tsv.gz

└─ dataset/
   └─ sub-<label>/
      └─ ses-<label>/
         └─ func/
            ├─ <matches>_[recording-<label>]_physio.json 
            └─ <matches>_[recording-<label>]_physio.tsv.gz

When recording physiological data, we RECOMMEND to always record and save the data with the least amount of processing possible applied to it following this specification. If derivatives are computed in real time, we RECOMMEND to save them following the derivatives BEP, and to also store raw data following this concBEP.

1.2 Splitting concurrently acquired data into multiple files

Recorded physio data MUST be split into separate data files in case of difference in top-level metadata like SamplingFrequency, Software, and Manufacturer of the main recording device (i.e., data source). These top-level metadata are discussed in the following section.

Data with common top-level metadata MAY be kept aggregated in one file otherwise, or split based on channel type, if preferred. The sole exception is eye tracking data, that MUST be split in its own file, following BEP020 specifications.

We generally recommend keeping different files from different recording devices separate, but the option to keep data together acknowledges not only current standards in data collection, but also the fact that often physiological data is inspected and analysed together to get a clearer picture of what the fluctuations describe (e.g., looking at ventilation and respiration together, or PPG and ECG for motion artifacts).

Moreover, the set of metadata we are proposing managed to consider most, if not all, possible channel types - with the exception of eye tracking. Thus, the choice to aggregate physiological data with common key metadata in a single file is left to user preference.

We RECOMMEND to store trigger signals recorded alongside physiological channels in the same file when concurrent modalities are collected (e.g. functional MRI or EEG).

For example:

Splitting recorded data into separate physio data files

└─ sub-001/
   └─ ses-01/
      └─ physio/
         ├─ sub-001_ses-01_recording-scr_physio.json 
         ├─ sub-001_ses-01_recording-scr_physio.tsv.gz 
         ├─ sub-001_ses-01_recording-ecg_physio.json 
         ├─ sub-001_ses-01_recording-ecg_physio.tsv.gz 
         ├─ sub-001_ses-01_recording-resp_physio.json 
         └─ sub-001_ses-01_recording-resp_physio.tsv.gz

Combining recorded data into one pair of physio data files

└─ sub-001/
   └─ ses-01/
      └─ physio/
         ├─ sub-001_ses-01_physio.json 
         └─ sub-001_ses-01_physio.tsv.gz

└─ dataset/
   └─ sub-<label>/
      └─ ses-<label>/
         └─ physio/
            ├─ sub-001_ses-01_physio.json 
            └─ sub-001_ses-01_physio.tsv.gz

It is possible that the recording-<label> entity uses terms that could be confused with metadata field values, such as MeasurementType or SamplingFrequency. In that case, the lowest metadata level available should always be interpreted as the most reliable information. For instance, if the file name contains recording-1000hz but the SamplingFrequency metadata indicates a sampling frequency of 100Hz, data MUST be interpreted as being sampled at 100 Hz. Similarly, if the entity recording-ecg is used, but the MeasurementType metadata of the contained columns indicate “ppg” and “Ventilation”, the data MUST be interpreted as PPG and Ventilation, and not ECG.

2. JSON Data files

Metadata sidecar files (<matches>_physio.json) SHOULD define the field PhysioType. This field indicates a specific type of formatting, rather than a physiological modality. The PhysioType "generic" value, being the default, MUST be assumed if the PhysioType metadata is not defined.

All metadata we are proposing are either OPTIONAL or RECOMMENDED, and they are meant to enrich the current "generic" PhysioType. However, we are also suggesting the introduction of a "specified" PhysioType, that will differ from "generic" because one proposed metadata, MeasureType, will be REQUIRED rather than RECOMMENDED. Equally, the Units metadata will be REQUIRED instead of RECOMMENDED in this case.

Compared to the current BIDS specification (1.10.0), at the file level we are adding one metadata, the OPTIONAL SubjectPosition, indicating the position of the subject during the data collection (see section 2.1).

When specifying column names, columns MUST have unique names. All such data columns MUST be appropriately defined in the JSON metadata.

Example:

{
  "Columns": ["screda1", "screda2", "ecg", "ppg"],
  "SamplingFrequency": 1000,
  "SubjectPosition": "sitting",
  "PhysioType": "specified",
  ...
  "screda1": {
    "MeasureType": "EDA-phasic",
    "Units": "mS",
    "Placement": "Thenar",
    ...
  },
  "screda2": {
    "MeasureType": "EDA-tonic",
    "Units": "mS",
    "Placement": "Hypothenar",
    ...
  },
  "ecg": {
    "MeasureType": "ECG",
    "Units": "mV",
    "Placement": "II",
    ...
  },
  "ppg": {
    "MeasureType": "PPG",
    "Units": "au",
    "Placement": "Right earlobe",
    ...
  },
  ...
}

As described in the following table (Section 2.2), this BEP is adding a few metadata to describe columns.

The most important one is MeasureType, a RECOMMENDED metadata that indicates the actual nature of the data in the column.
- This metadata value is a string that MUST come from a set of keywords (see table 2.2).
- This set of keywords can be expanded in the future to include more physiological modalities.
- When the file-level metadata PhysioType is "specified", MeasureType becomes a REQUIRED field for each column.

This metadata is meant to be the most reliable indicator of the type of data contained in the described column. Having a reliable and standardized indication of what type of data is being handled allows automated modality specific data processing and prevents data misuse.

Furthermore, we are proposing that Units becomes a REQUIRED metadata when PhysioType is "Specified". Not only this helps to better reflect the possible quantitative nature of physiological data, but since similarly labelled data (e.g. Ventilation) can be expressed in different units, indicating different underlying processes, sensors, or levels of real-time preprocessing and data manipulation (e.g. transformation from Volts to millimeters of Mercury), making this field more explicit in the section regarding physiological data will help improve data interpretation. Specification of units SHOULD follow the International System of Units (see BIDS specification).

We are also introducing a Placement RECOMMENDED metadata, that describes the position of the sensor during data collection. For instance, a file could have three columns of ventilation data, one collected at the navel, one at the diaphragm, and one at the armpit level, in which case Placement values would be “Navel”, “Diaphragm”, and “Armpit” respectively. In case the data describes gas concentration, such as CO2 or O2, Placement SHOULD be used to indicate if a “Nose” cannula versus a “Mouth” mouthpiece or a “Mask” was used.

The three metadata at this level describing hardware are:

ChannelManufacturersModelName (RECOMMENDED)
ChannelManufacturers (RECOMMENDED)
ChannelDeviceSerialNumber (OPTIONAL)

These metadata are meant to describe the nature of the equipment used to record data. Different components from different manufacturers could be used at the same time in a “patchwork” approach in which a sensor or amplifier from manufacturer A is connected to the recording device of manufacturer B, and even the same manufacturer could provide two or more options to measure the same type of data. Many setups that differ in this way introduce a potential difference in data processing (e.g. digital vs analogical lags, delays and sharpness of the recording, quantification, …).

Thus, we RECOMMEND to increase the granularity of the setup description for each column, and we RECOMMEND to report names and manufacturers (when different from the main unit) of sensors, connective elements (e.g. cannulae or cables), and amplifiers. Serial numbers MAY be reported as well.

In this framework, it is crucial to distinguish between the different fields available for specifying recording equipment in the meta-data: at the top-level, the main recording device and software are characterized in meta-data fields such as SoftwareModels and DeviceSerialNumber, while at the column-level, information about channel-specific hardware is characterized in meta-data fields such as ChannelDeviceSerialNumber.

We provide the example shown above to assist in determining the main recording device in common physiological acquisition set-ups. In the example shown above, three different recording systems are being used to concurrently acquire physiological data. The first system acquires two channels of physiological data with software A and main recording device ‘a’, which both would be specified using the top-level fields in the accompanying meta-data. Upstream, hardware such as amplifiers, filters, cables, and sensors would be specified using column-level fields specific to each channel in the accompanying meta-data. In the second system, one channel of physiological data is being acquired by main recording device ‘b’ and wirelessly transmitted to software B. In this case, the sensor attached to device ‘b’ can still be specified using column-level meta-data fields if it is an independent product. In the third system, data is acquired by a physiological monitoring unit which is integrated with an MRI scanner (device ‘c’), which itself acts as the main recording device. In case of using networked middleware systems such as the lab streaming layer, where the data may be centrally recorded, the central recording computer itself MAY be considered the main recording device.

Finally, the AmplifierSettings is a dictionary meant to be filled with potential amplifier settings that can manipulate the data collection at the source, e.g. low-pass filters or DC/AC currents. Because each amplifier and each manufacturer have different settings, we cannot define further the content of this dictionary, but we suggest using manufacturer specific pairs of keys and values. In this dictionary, we also SUGGEST reporting eventual data transformations (e.g. the exact formula used to transform gas pressure from measured Voltage to millimetres of Mercury).

More information about the metadata entities contained in the JSON files can be found in the tables below.

2.1 Metadata fields used in top level metadata

Key name	Requirement Level	Data type	Description
SamplingFrequency	REQUIRED	number	Sampling frequency (in Hz) of all the data in the recording, regardless of their type (for example, `2400`).
StartTime	REQUIRED	number	Start time in seconds in relation to the start of acquisition of the first data sample in a reference file sharing the same "ConcurrenceGroup" identifier (negative values are allowed). Within each set of “ConcurrenceGroup” files, a reference file MUST be designated with a “StartTime” equaling 0. This data MAY be specified with sub-second precision using the syntax `s[.000000]`, where `s` reflects whole seconds, and `.000000` reflects OPTIONAL fractional seconds. If no ConcurrenceGroup identifier is defined, the StartTime should be set to 0.
Columns	REQUIRED	array of strings	Names of columns in file.

2.2 Metadata fields for column description

Column name	Requirement Level	Data type	Description
cardiac	OPTIONAL	number	continuous pulse measurement
respiratory	OPTIONAL	number	continuous breathing measurement
trigger	OPTIONAL	number	continuous measurement of the scanner trigger signal
Additional Columns	OPTIONAL	`n/a`	Additional columns are allowed.

2.3 MeasureType descriptions

MeasureType	Name	Description
Trigger	Trigger	Digital (binary TTL) or analog (TTL in Volt) values indicating scanner triggers.
PPG	Photoplethysmography	Continuous optical signal capturing the cardiac pulsation.
ECG	Electrocardiography	Continuous electrical signal capturing the cardiac activity.
Ventilation	Ventilation	Continuous breathing measurement.
CO2	Carbon dioxide	Continuous measurement of the carbon dioxide concentration in expired air.
O2	Oxygen	Continuous measurement of the oxygen concentration from respiratory gases.
PetCO2	End-tidal carbon dioxide	Continuous measurement of the end-tidal pressure of carbon dioxide at the end of an exhalation.
PetO2	End-tidal oxygen	Continuous measurement of the end-tidal pressure of oxygen at the end of an inhalation.
EDA-tonic	Electrodermal activity, tonic component	Continuous measurement of low-frequency changes in electrodermal activity, also known as skin conductance level.
EDA-phasic	Electrodermal activity, phasic component	Continuous measurement of high-frequency changes in electrodermal activity, also known as skin conductance response.
EDA-total	Electrodermal activity	Continuous measurement of the changes in electrical properties of the skin.
BP	Blood pressure	Continuous measurement of the blood pressure waveform representing the changes in arterial pressure over time.
Other	Other	Any other type of channel.