Modelling the domain in 4 steps
A note on the modelling steps: my first attempt at modelling the domain took me quite far into the process, up to step 3, and so I spent quite a lot of time going back and forth between the various stages as well as the metadata analysis and alignment, which might mean there's a few inconsistencies here and there.
The color legends for the first 3 steps are:
- Subject
- Item
- Entity with additional relationships
All images can be clicked through for full resolution and downloaded as svg files.
Step 1 - Conceptual Map
The first modelling step was an attempt at explaining the scenario by representing the real data from the chosen items alongside the explicit relationships.
The main takeaway from this for me was that the chosen items could allow for a variety of connections both within and without the domain. This was further expanded upon in later steps while keeping it focused to entities that were most relevant to this particular snapshot of Yancey's life and death.
Step 2 - E/R Model
The next step involved translating the conceptual map into an Entity/Relationship model. Early attempts at this produced a large model that became unwieldy in its representation. As such I ultimately decided to use yED's inbuilt E/R icons, including the entity with attribute icon, to group attributes without additional relationships under the item, thus making the model a little clearer while also bringing forward the depth of interconnections available.
Step 3 - Theoretical Model
We then move to the theoretical model, which essentially translates the E/R model into a first level of abstraction. This was also the stage at which we were asked to think about additional data to add to the models, but as mentioned above I had already started doing this at an earlier stage and very much realised I had to stop myself if I wanted to keep this manageable (and also because I had to go back and fix a bunch of things!).
In terms of the who/where/when/what questions that the model had to answer here are some of the notes from the process:
- Who: I added some properties to the person at the center of the domain (birth, death, cause of death) which were connected to other entities/items. I also tried to reflect as many of the interpersonal relations that exist/ed between agents such as Yancey, his mother, and the labels and people he worked with and who documented him
- Where: This is mainly about geographical locations as well as holding institutions. The trickiest one was the idea of place as related to the beginning and end of Yancey's career as represented by the discographies and detailed in the MusicBrainz metadata of the pages
- When: This was the most limited range. There’s really two data types for this: single dates (years and a specific day in case of the event) and time-spans (date ranges). All dates relate in some way to either creation, production, or publication
- What: In contrast to the above, this was the range with the widest potential (alongside Who if that were to be expanded beyond the boundaries of the domain's snapshot). I chose to focus that question on item types, physical and topical descriptions, connections between items (such as an item being included in the scope of another, or related to it topically etc...)
Step 4 - Conceptual Model
The fourth and final modelling step was to take the theoretical model and refine the abstraction of the domain using existing ontologies and vocabularies. This was done using the standards highlighted in the analysis/alignment phase as well as some additional generic and specific ones such as:
- rdf/s: to show some of the class structures and item labels, for ease of reading
- foaf, wd, xsd, and ti: to help simplify some property descriptions like subject of and time spans
- mo, pro: to help describe production/publishing relationships which were quite unwieldy to do in CIDOC-CRM/FRBROO
I did my best to try and keep the model as simple as possible while still reflecting all the possible details and relationships that were included in previous stages. I think the final result once again shows the potential depth of the domain and how intricate it could become if we went past the additional entities I limited myself to or if we were able to use items with additional qualities (like for example some personal recordings Yancey made for himself, records he owned etc...). I'll return to these last two ideas in my conclusion.