Unlike other fields which perform comprehensive diagnostics or characterization prior to analysis, the evaluation of patient representation learning methods has largely been limited to the context of a specific downstream use case and is usually performed as part of model interpretation. While it cannot solve all of the aforementioned challenges, data-driven characterization of patient representations, independent of model development, may provide invaluable and unexpected insight and is an important first step towards understanding if these methods can be used to help automate CP development. To this end, we sought to answer the following questions:
RSQ1: What combinations of data type and sampling window create the best patient representations and does performance differ by disease group?
RSQ2: How does data-driven characterization of patient representation impact the explainability of downstream tasks like clustering?
To address these que

