Here are some loose facts in no particular order:
- Courses are imported and exported as tar.gz files
- A typical Open edX course has the following hierarchy 3. sections 4. subsections 5. units 6. videos, html files, exercises
- files are named as a hash, but if the two occurrences of a hash are replaced by a common, unique name the whole directory will still generate a valid course
- The XML structure obeys a hierarchy that is parallel to the course structure: 3. course.xml 4. course directory - determines section order 4. chapter directory - contains sections and determines subsection order 5. sequential directory - contains subsections and determines unit order 6. vertical directory - contains units
- course.xml contains name of course directory
- Course directory contains a file with the course structure and provides references to individual files in the chapter directory 4. changing sequence in the file within the course directory will change the sequence of sections
- Chapter directory contain files describing each course section. 5. Not sure why this directory is called "chapter" rather than section, but its name cannot be changed. 6. Each section file within the chapter directory is named as a hash and contains a reference to a file in the sequential directory 7. Each section file contains a date when the section should be made public, in format 2016-01-01T00:00:00Z
- Sequential directory contains files descring a subsection. 6. each file in the sequential directory is named as a hash and points to a file in the vertical directory
- each file in the vertical directory describes a unit