How to drive odoc

This 'live' document describes how to use odoc to produce the documentation of odoc itself. The aim is to show a short, simple example of how odoc can be used, covering most of the important features. The document built here includes not only the documentation of odoc itself, but also builds the docs for a subset of odoc's dependent libraries to show how this may be done. For a much more complete and comprehensive use of odoc, see the voodoo project, the tool that will be used to build docs.ocaml.org.

First we need to initialise mdx with some libraries and helpful values.

ocaml env=e1
(* Prelude *)
#require "bos";;
#install_printer Fpath.pp;;
open Bos;;
let (>>=) = Result.bind;;
let (>>|=) m f = m >>= fun x -> Ok (f x);;

Desired output

Odoc produces output files \(html or others\) in a structured directory tree, and so before running odoc the structure of the output must be decided. For these docs, we want the following structure:

The odoc model for achieving this is that we have pages \(mld files\) that have children that are either further pages \(mld files\) or modules \(from cmti files\). This parent/child relationship is specified on the command line. Parent pages must be compiled by odoc before their children, and compiling a page mypage.mld will produce the file page-mypage.odoc.

For example, in this case there will be a file odoc.mld that corresponds with the top-level directory odoc, and will be compiled as follows:

odoc compile odoc.mld --child page-odoc_model --child page-odoc_parser --child deps ...

and a file deps.mld that corresponds with the sub-directory odoc/deps, which will be compiled as follows:

odoc compile deps.mld -I . --parent odoc --child page-stdlib --child page-astring ...

the file odoc_model.mld will have a child module Odoc_model and will be compiled as follows:

odoc compile odoc_model.mld -I . --parent odoc --child module-Odoc_model

When compiling any mld file, the parent and all children must be specified. Parents can only be pages from other mld files, and children may be pages \(from mld files\) or modules \(from cmti/cmt or cmi files\).

The parent page must exist before the child page is created, and must have specified the child when it itself was compiled.

Document generation phases

Using odoc is a three-phase process.

    This takes the output from the compiler in the form of cmti, cmt, or cmi files \(in order of preference\), translates it into odoc's internal format, and performs some initial expansion and resolution operations. For a given input /path/to/file.cmti it will output the file /path/to/file.odoc unless the -o option is used to override the output file. If there were cmi dependencies required for OCaml to compile these files, then there will be equivalent odoc dependencies required to perform the odoc compile step. Odoc will search for these dependencies in the paths specified with the -I directive on compilation. Odoc provides a command to help with this: odoc compile-deps:

    As an example we can run odoc compile-deps on the file ../src/xref2/.odoc_xref2.objs/byte/odoc_xref2__Compile.cmti:

    $ odoc compile-deps ../src/xref2/.odoc_xref2.objs/byte/odoc_xref2__Compile.cmti | tail -n 5
    Stdlib__result 2ba42445465981713146b97d5e185dd5
    Stdlib__seq d6a8de25c9eecf5ae9420a9f3f8b2e88
    Stdlib__set 5d365647a10f75c22f2b045a867b4d3e
    Stdlib__uchar ab6f1df93abf9e800a3e0d1543523c96
    Odoc_xref2__Compile e0d620d652a724705f7ed620dfe07be0

    so we can see we will need to run odoc compile against several Stdlib modules before we can compile odoc_xref2__Compile.cmti

      This takes the odoc files produced during the compilation step performs the final steps of expansion and resolution. It is during this phase that all of the references in the documentation comments are resolved. In order for these to be resolved, everything that is referenced must have been compiled already, and their odoc files must be on the include path as specified by the -I arguments to odoc link. In this example, we achieve that by compiling all modules and mlds before linking anything. The output of the link step is an odocl file, by default in the same path as the original odoc file.

      Note that is only necessary to link the non-hidden modules \(without a double underscore\).

        Once the compile and link phases are complete, the resulting odocl files may be rendered in a variety of formats. In this example we output HTML.

        Odoc documentation

        In this section odoc is used to generate the documentation of odoc and some of its dependent packages. We can make a few simplifying assumptions here:

          We start with some functions to execute the three phases of odoc.

          Compiling a file with odoc requires a few arguments: the file to compile, an optional parent, a list of include paths, a list of children for mld files, and an output path. Include paths can be just '.', and we can calculate the output file from the input because all of the files are going into the same directory.

          Linking a file with odoc requires the input file and a list of include paths. As for compile we will hard-code the include path.

          Generating the HTML requires the input odocl file and an output path. We will hard-code the output path to be html.

          In all of these we'll ignore stderr.

          ocaml env=e1
          let odoc = Cmd.v "odoc";;
          
          let compile file ?parent children =
              let output_file =
                  let ext = Fpath.get_ext file in
                  let basename = Fpath.basename (Fpath.rem_ext file) in
                  match ext with
                  | ".mld" -> "page-"^basename^".odoc"
                  | ".cmt" | ".cmti" | ".cmi" -> basename^".odoc"
                  | _ -> failwith ("bad extension: " ^ ext)
              in
              let open Cmd in
              let cmd =
                  odoc % "compile" % Fpath.to_string file % "-I" % "." % "-o" % output_file |>
                  List.fold_right (fun child cmd -> cmd % "--child" % child) children
              in
              let cmd =
                  match parent with 
                  | Some p -> cmd % "--parent" % p
                  | None -> cmd
              in
              OS.Cmd.(run_out ~err:err_null cmd |> to_lines) |> Result.get_ok
          
          let link file =
              let open Cmd in
              let cmd = odoc % "link" % p file % "-I" % "." in
              OS.Cmd.(run_out ~err:err_null cmd |> to_lines) |> Result.get_ok
          
          let html_generate file =
              let open Cmd in
              let cmd = odoc % "html-generate" % p file % "-o" % "html" in
              OS.Cmd.(run_out cmd ~err:err_null |> to_lines) |> Result.get_ok
          
          let support_files () =
              let open Cmd in
              let cmd = odoc % "support-files" % "-o" % "html" in
              OS.Cmd.(run_out cmd |> to_lines) |> Result.get_ok

          We'll now make some library lists. We have not only external dependency libraries, but Odoc itself is separated into libraries too. These two sets of libraries will be documented in different sections, so we'll keep them in separate lists, together with a We start by declaring a few lists representing these sections as well as a list mapping the section to its parent matching to the hierarchy declared above.

          ocaml env=e1
          let dep_libraries = [
              "stdlib";
              "astring";
              "cmdliner";
              "fpath";
              "result";
              "yojson";
              "tyxml";
          ];;
          
          let odoc_libraries = [
              "odoc_xref_test"; "print"; "odoc_xref2"; "odoc_parser"; "odoc_odoc";
              "odoc_model_desc"; "odoc_model"; "odoc_manpage"; "odoc_loader";
              "odoc_latex"; "odoc_html"; "odoc_document"; "odoc_compat" ];;
          
          let all_libraries = dep_libraries @ odoc_libraries;;
          
          let extra_docs = [
              "interface"
          ]
          
          let parents =
              let add_parent p l = List.map (fun lib -> (lib, p)) l in
              (add_parent "deps" dep_libraries) @ (add_parent "odoc" odoc_libraries);;

          Odoc operates on the compiler outputs. We need to find them for both the files compiled by dune within this project and those in libraries we compile against. The following uses ocamlfind to locate the library paths we're looking in for our dependencies:

          ocaml env=e1
          let ocamlfind = Cmd.v "ocamlfind";;
          
          let lib_path lib =
              let cmd = Cmd.(ocamlfind % "query" % lib) in
              OS.Cmd.(run_out cmd |> to_lines >>|= List.hd);;
          
          let lib_paths =
              List.fold_right (fun lib acc ->
                  acc >>= fun acc -> lib_path lib >>|= fun l -> (lib, l) :: acc) dep_libraries (Ok []) |> Result.get_ok

          We need a function to find odoc inputs given a search path. The files that odoc operates on are cmti, cmt or cmi files, in order of preference, and the following function finds all files like that given a search path, returning an Fpath.Set.t containing Fpath.t values representing the absolute path to the file without its extension.

          ocaml env=e1
          let find_units p =
              OS.Dir.fold_contents ~dotfiles:true
                  (fun p acc ->
                      if List.exists (fun ext -> Fpath.has_ext ext p) ["cmt";"cmti";"cmi"]
                      then p::acc
                      else acc)
                  []
                  (Fpath.v p)
              >>|= fun paths ->
              let l = List.map Fpath.rem_ext paths in
              List.fold_right Fpath.Set.add l Fpath.Set.empty;;

          Since the units returned by this function have their extension stripped, we need function to find the best file to use given this basename.

          ocaml env=e1
          let best_file base =
              List.map (fun ext -> Fpath.add_ext ext base) ["cmti";"cmt";"cmi"] |> List.find (fun f -> Bos.OS.File.exists f |> Result.get_ok) 

          Many of the units will be 'hidden' \-- that is, their name will be mangled by dune in order to namespace them. This is achieved by prefixing the namespace module and a double underscore, so we can tell by the existence of a double underscore that a module is intended to be hidden. The following predicate tests for that condition:

          ocaml env=e1
          let is_hidden path =
              Astring.String.is_infix ~affix:"__" (Fpath.to_string path)

          To build the documentation, we start with these files. We'll call odoc compile-deps on the file to find out which other compilation units it depends upon, with the following function:

          ocaml env=e1
          type compile_deps = {
              digest : Digest.t;
              deps : (string * Digest.t) list;
          }
          let compile_deps f =
              let cmd = Cmd.(odoc % "compile-deps" % Fpath.to_string f) in
              OS.Cmd.(run_out cmd |> to_lines) >>|=
              List.filter_map (Astring.String.cut ~sep:" ") >>= fun l ->
              let basename = Fpath.(basename (f |> rem_ext)) |> String.capitalize_ascii in
              match List.partition (fun (n, _) -> basename = n) l with
              | [ (_, digest) ] , deps -> Ok {digest; deps}
              | _ -> Error (`Msg "odd")

          Let's now put together a list of all possible modules. We'll keep track of which library they're in, and whether that library is a part of odoc or a dependency library.

          ocaml env=e1
          let odoc_all_units = find_units ".." |> Result.get_ok;;
          
          let odoc_deps = List.map (fun lib ->
              Fpath.Set.fold (fun p acc ->
                  if Astring.String.is_infix ~affix:lib (Fpath.to_string p)
                  then ("odoc",lib,p)::acc
                  else acc) odoc_all_units []) odoc_libraries;;
          
          let lib_deps = List.map (fun (lib, p) ->
              Fpath.Set.fold (fun p acc ->
                  ("deps",lib,p)::acc) (find_units p |> Result.get_ok) []) lib_paths;;
          
          let all_deps = (odoc_deps @ lib_deps) |> List.flatten;;

          Let's compile all of the parent mld files. We have in here

          ocaml env=e1
          let compile_mlds () =
              let mkpage x = "page-" ^ x in
              let mkmod x = "module-" ^ x in
              let mkmld x = Fpath.(add_ext "mld" (v x)) in
              compile (mkmld "odoc") ("page-deps" :: (List.map mkpage (odoc_libraries @ extra_docs)));
              compile (mkmld "deps") ~parent:"odoc" (List.map mkpage dep_libraries);
              let extra_odocs = List.map (fun p ->
                  ignore(compile (mkmld p) ~parent:"odoc" []);
                  "page-"^p^".odoc") extra_docs in
              let odocs = List.map (fun library ->
                  let parent = List.assoc library parents in
                  let children = List.filter_map (fun (parent, lib, child) -> if lib=library then Some (Fpath.basename child |> mkmod) else None) all_deps in
                  compile (mkmld library) ~parent children;
                  "page-"^library^".odoc"
                  ) all_libraries in
              List.map Fpath.v ("page-odoc.odoc" :: "page-deps.odoc" ::  odocs @ extra_odocs)
          ocaml env=e1
          
          let compile_all () =
              let mld_odocs = compile_mlds () in
              let deps = Hashtbl.create 100 in
              let rec rec_compile lib file =
                  let output = Fpath.(base (set_ext "odoc" file)) in
                  if OS.File.exists output |> Result.get_ok
                  then []
                  else begin
                      let deps = compile_deps file |> Result.get_ok in
                      let files = List.fold_left (fun acc (dep_name, digest) ->
                          match List.find_opt (fun (_,_,f) -> Fpath.basename f |> String.capitalize_ascii = dep_name) all_deps with
                          | None -> []
                          | Some (_,lib,dep_path) ->
                              let file = best_file dep_path in
                              (rec_compile lib file) @ acc
                      ) [] deps.deps in
                      ignore(compile file ~parent:lib []);
                      output :: files
                  end
              in
              List.fold_left (fun acc (parent, lib, dep) ->
                  acc @ rec_compile lib (best_file dep)) [] all_deps
                  (* (List.flatten odoc_deps) *)
              @ mld_odocs
          
          let link_all odoc_files =
              let not_hidden f = not (is_hidden f) in
              List.map (fun odoc_file ->
                      ignore(link odoc_file);
                      Fpath.set_ext "odocl" odoc_file)
                  (List.filter not_hidden odoc_files)
          
          let generate_all odocl_files =
              List.iter (fun f -> ignore(html_generate f)) odocl_files;
              support_files ()
          ocaml env=e1
          # compile_all () |> link_all |> generate_all;;
          - : string list = []