datasetjson - Read and write CDISC Dataset JSON formatted datasets in R and Python
R/Pharma 2025 Workshop
2025-11-07
Convert XPT files using defaults (NDJSON format):
Convert SAS7BDAT files to JSON format:
Convert Dataset-JSON to XPT
from dsjconvert import XPTConverter, MetadataExtractor
# With Define-XML metadata
extractor = MetadataExtractor('path/to/define.xml')
converter = XPTConverter(
metadata_extractor=extractor,
output_format='ndjson',
skip_validation=True
)
converter.convert_dataset('input.xpt', 'output_dir')
# Without Define-XML (auto-infer metadata)
converter = XPTConverter(output_format='ndjson')
converter.convert_dataset('input.xpt', 'output_dir')Convert multiple SAS7BDAT files to JSON format:
import os
from dsjconvert import SAS7BDATConverter
converter = SAS7BDATConverter(output_format='json')
# Get all SAS files
sas_dir = 'data'
sas_files = [f for f in os.listdir(sas_dir) if f.endswith('.sas7bdat')]
# Convert each file
for sas_file in sas_files:
input_path = os.path.join(sas_dir, sas_file)
output_path = converter.convert_dataset(input_path, 'output')
print(f"Converted: {output_path}")Convert Dataset-JSON to SAS XPT
exercises/02-python.py and work through the exercises