Difference between revisions of "Data interchange format"

From ECHSA Congenital Database
Jump to navigationJump to search
(Link to new article)
(Advantages of the XML format)
Line 11: Line 11:
  
 
All dictionaries (wich factor codes) can be found in the [[source code]]. You can use a direct link to the [http://www.eactscongenitaldb.org/viewcvs/ecdbc/trunk/DataCollectionSoftware/structure/Dictionaries.xml?root=ecbdc&view=markup dictionaries] file.
 
All dictionaries (wich factor codes) can be found in the [[source code]]. You can use a direct link to the [http://www.eactscongenitaldb.org/viewcvs/ecdbc/trunk/DataCollectionSoftware/structure/Dictionaries.xml?root=ecbdc&view=markup dictionaries] file.
 +
 +
===Why the XML-base format?===
 +
 +
We want to standardize the data submission procedure, using the portable XML format. It's implemented in our data collection software and on the server. The XML format is more comprehensive than comma-delimited files, because it reflects the actual structure of the data. For example:
 +
 +
<Patient>
 +
  <Admission>
 +
    <Operation/>
 +
    <Operation/>
 +
  </Admission>
 +
</Patient>
 +
 +
Shows, how the patient, admission and operation objects are related to each other: Admission inside Patient and Operation inside Admission. Such way of describing the data is much less error-prone than the CSV files.
 +
 +
===Advantages of the XML format===
 +
 +
*Reflects the structure of the data (tables do not)
 +
*Self-describing: you can see the name of each field
 +
*Transparent, can be opened with a text editor
 +
*Less error-prone than CSV files
 +
*Portable, can transfer data across many software systems
 +
*Solves national character encoding problems (by declaring the encoding)
 +
*Widely known
 +
*Can be transformed, using an XSLT stylesheet, into any other text-based format such as HTML, RTF, CSV, LaTeX, or a custom format.
  
 
==Export to CSV==
 
==Export to CSV==

Revision as of 19:38, 20 June 2006

The fundamental format for data exchange in EACTS Database Software is XML.

Format Specification

This is a DTD (Document Type Definition) which can be used to validate an XML file.

This DTD file defines the relations between patients, admissions, operations, etc. in the export file.

If you want to see the codes of diagnoses, procedues, complications, nca and gprf, download the software, perform a complete export to XML and examine the file.

All dictionaries (wich factor codes) can be found in the source code. You can use a direct link to the dictionaries file.

Why the XML-base format?

We want to standardize the data submission procedure, using the portable XML format. It's implemented in our data collection software and on the server. The XML format is more comprehensive than comma-delimited files, because it reflects the actual structure of the data. For example:

<Patient>
  <Admission>
    <Operation/>
    <Operation/>
  </Admission>
</Patient>

Shows, how the patient, admission and operation objects are related to each other: Admission inside Patient and Operation inside Admission. Such way of describing the data is much less error-prone than the CSV files.

Advantages of the XML format

  • Reflects the structure of the data (tables do not)
  • Self-describing: you can see the name of each field
  • Transparent, can be opened with a text editor
  • Less error-prone than CSV files
  • Portable, can transfer data across many software systems
  • Solves national character encoding problems (by declaring the encoding)
  • Widely known
  • Can be transformed, using an XSLT stylesheet, into any other text-based format such as HTML, RTF, CSV, LaTeX, or a custom format.

Export to CSV

see article: Export to CSV

Import from other databases

see article: Import from other databases.

See also