Semantics vs Models

December 24th, 2008

The idea of a semantic web or just in general providing semantics is essentially to augment data with additional data providing more information about what the original data is about (and standardise this to enable communication/interfacing). This is supposed to enable algorithms to process the original data better and in some sense understand that data. Calling this meta data seems to be justified in as far as that the additional data is actually not directly used or useful to humans, but it is intended for the algorithms. Humans can go quite far with interpreting data without this additional information, as far as their knowledge, experience and intelligence actually allows them to.

However, it seems that this means the data provider also has to provide all useful semantic information and that simply seems to be impossible. The data provider can provide this information for the data in the original context, but the data itself may have a far wider use. This would be missed by any algorithm as it would not reinterpret the data in a new, unknown context, but only use the original interpretation(s). More generally it seems to suggest a purely extrinsic notion of understanding independent of the observer. While semantics information may be useful in a very narrowly defined, specific context, a single project or precisely defined subject area, in a wider context it seems hardly achievable or useful.

Instead, a system which builds models (of any sort, not just statistically, even if statistical models seem to be important, especially for human interpretation) based on the data available seems to enable reinterpretation of the data and enable using it in different contexts. The resulting models derived from the data may then still be augmented with semantics information to make them accessible. This would base semantics on actual data instead of trying to make the data fit a particular view. But this is not always necessary or needed, depending simply on the use of the resulting models. Of course a certain bias can still be present depending on the type of models used. It also makes semantics more intrinsic, depending on the observer and how it/she/he builds the model. Many different models may even be build to explore concepts and these models may not be easily translatable into each other, if at all. Of course the question now becomes how to (reasonably quickly) build such models.

This is not just related to the semantic web, but also, say, to interpreting geometric models in the sense of describing it in terms that are meaningful to someone wishing to process (create, edit, analyse, etc.) the model. A fixed description of a model’s design intent in this context via history, geometric constraints, regularities, etc. seems to be restricting in a similar way than providing rather universal semantics to data on the web and does not allow for reinterpretation and with that reuse.

Categories: General | No Comments

A Beginning…

November 22nd, 2008

This notebook or blog is intended to become a collection of notes and ideas based on my work. I still have to see how this actually develops before I can say more. At the moment it is an experiment to see how far publishing raw thoughts, ideas and results is at all useful and practicable. There are also other sites associated with my work relating to more complete results and ongoing projects. Some of these are similarly experimental and undefined while others aim more at making the final results freely accessible.

Ex Tenebris Scientia is my personal home page which provides an overview of my work and some other activities. Most of its contents related to final results and published work, rather than work in progress or initial ideas. It’s also focused on me, rather than on projects/work.

Astarte is a development site based on trac with various version control systems, continuous integration, etc. It is mainly aimed at developing, testing and releasing software. As most of the work I’m doing relates to algorithms and software, most of the ongoing projects and related information is likely to go on this site. The name is based on the name of a rather ambitious project of revising the way I (or maybe we) are using computers by devising a new programming centric framework for using a computer. I will, however, not discuss more details about this in the near future.

X=10Z is a wiki site and also hosts this blog. The wiki site has been created for documentation purposes, but what precisely this means I still have to define. One idea at the moment is to put material relating to the courses I’m teaching there to develop something like a textbook. Research results and related information may be documented there in a similar way. Associated with this sites are also blogs, like this one (currently the only one), to keep notes, etc. related to a person, project or topic.

Beyond that I’ll have to see how things are developing. Being busy with loads of things I do not expect to quickly provide a lot of content for any of this.

One final note, in particular for this blog: the content here may be messy, unorganised, incomplete or simply wrong. So do not hold me responsible for any of these comments… I’ll check the content elsewhere more carefully than here. But feel free to comment on anything…

Categories: General | No Comments