>
Topic Maps Release : February
2001 - Paper presented at : Knowledge
Technologies 2001 Abstract Contents Introduction
: Managing Complexity Introduction : Managing Complexity Complexity is the main issue of many information and knowledge environments, and the Web itself, chaotic representation of the real world, seems to achieve the complexity paradigm. Many definitions of complexity have been proposed, from the physicists theories of non-linear phenomena, to linguistics and social sciences descriptions. [1] From an information and knowledge management viewpoint, a definition of complexity will include at least the following characteristics:
The Topic Map approach provides tools to tackle these specific issues. Managing complexity is in fact what Topic Map technology is all about. The proposed methodology is therefore mainly addressing representation and management of information in such complex environments. Corporate R&D or Administration Intranets, Information or Business Web Portals are some among many application fields of such a methodology, not to mention the potential development of educational tools in various fields, from Social Sciences to Biology, Health and Environment Technologies, wherever classical knowledge representations are unable to manage the complexity issues. Trying to build a Topic Map from
scratch, as a representation of the complex real world - or at least the
part of it the TM engineer is interested to represent - will lead most
surely to hard and long ontological debate about the nature of inhabitants
of this "real world". A more pragmatic approach is to consider the set of
all available information sources - addressable resources - as a
satisfying proxy, given it generally already contains many representations
of, and documentation on just about every object or subject in the
environment. The very abundance of these resources and the need to manage
them is in fact the main concern of the TM engineer in complex and
multidimensional information environments. A survey and good knowledge of
what is available and pertinent, what is stable or volatile at this ground
floor, should be therefore the first fundamental task for the TM engineer.
The second step following the
above survey is : What are those Resources about? First they give
information about Identified Objects, that is objects well defined
by some non-ambiguous public identifier(s) : People, Companies,
Trademarks, Catalog References, Laws and Regulations, Indexed Books or
Musical Works, Records, Standards, Administrative and Geographical
Subdivisions, past or present, known by some individual name, reference
number in some catalog, index, directory. To represent knowledge about the
above Objects and Resources, the TM engineer will need the vocabulary and
ontology living at the conceptual level : types, categories,
representations, ideas, relationships, whatever cannot be defined as an
Object in the above defined acceptation. There is no absolute reference at
that level, and construction of sharable concepts will always be the
result of some agreement, inside a given community or context, on the
sense of a given word or expression. 4. Classes and Types : Entangled hierarchies Even if there is no universal ontology, among widely shared concepts is the hierarchical relation, appearing under various appellations : category/subcategory, set/subset, class/subclass, with their variants type/subtype, whole/part, parent/child. These hierarchies are non ambiguous and mostly sharable when applied to sets of clearly identified and defined objects (or living creatures), and they are known there as taxonomies : Botanic and Zoology have produced the most famous and efficient examples of this method. But efficiency of a tool is always limited to a definite range. Pertinence of a taxonomic classification method is linked to some peculiar features of the application field.
Hence the hierarchical classification, which has some reasonable claim to universal consistency, since all ornithologists and bird watchers will agree on it :
Another consistent example will be - for those who care more about bikes than birdies :
In this example, it’s interesting to note that "Harley-Davidson" is implicitly considered as a type of motorcycles, in a taxonomy quite similar to the ornithologist’s. But now consider the following:
A more administrative view … but
consistent all the same … There "Harley-Davidson" is considered as the
company located in Milwaukee, Wisconsin, USA. The regional names are
shorthand for "US Companies" "Wisconsin Companies" and "Milwaukee
Companies". The two above hierarchies carry somehow completely independent
viewpoints on the same subject. But is it the same subject when the biker
asks : "Hey, where did you find that fantastic Harley-Davidson?", whereas
the statistician or manager asks : "Where can I get the Harley-Davidson
sales figures for the year 1999?" … difficult question …probably it’s not
really the same subject, but they are somehow aspects of the same one and
could not be made distinct in some resources. 5. Building Knowledge by multi-level Semantic Associations Consistent Topic Maps may be limited to the above defined elements : representation of resources, identification of objects, and definition of classes and entangled hierarchies. In such representations, that one would maybe tempted to call "objective", concepts are only used for definition of types. But the TM engineer will want, in most cases, to build a more refined knowledge representation, and for that include meaningful or semantic relationships. These relationships will use concepts to define roles in basically non-hierarchical associations, either between objects, or between concepts, or between assertions about the former or the later. Below are listed some out of many possible examples of these semantic associations, each of them creating a level of representation.
… "partnership" to define the relationship between two companies … "illustration" for the relationship between a book and its illustrator … "usage" for the relationship between a research center and an instrument aboard a satellite … or to take back again the above example, an association "starring" may link "Harley-Davidson", "Peter Fonda" and "Easy Rider" Topics in some Topic Map of American Popular Culture. Those non-hierarchical associations between objects will make available new directions of navigation, orthogonal to the hierarchical ones. In fact, they are what will make the real different flavor for end users between a classical hierarchical directory and a Topic Map directory.
In the environment and information system, both humans and system parts play functional roles, by which they are related to each other, and to the concepts involved in these functions. Reification of each function by a conceptual Topic, and associations linking this Topic to people and/or system parts or tools managing the function, builds a functional level of representation.
In a dictionary, terms both define and are defined by one another in a recursive way, and in fact, dictionaries are resources ready-made to be represented as Topic Maps through "definition" and "example" associations, and are one of many promising field for TM development. This level may be useful for proper definition of technical terms in a defined context. It may be an extract from some existing dictionary, reorganized in a TM form, or a built-in proper resource. That’s what has been done for the Mondeca Topic Navigator Help, in which a "micro" Topic Map has been built to present the Topic Maps terms and concepts, and functions of the software.
An association being reified, that is being made a subject of a Topic, further associations can represent viewpoints on this subject : agreement, discussion, commentary, expanding, refutation of any of those etc. In our biker example, the association "Easy Rider" – "Peter Fonda" – "Harley-Davidson" , being some sort of "cultural subject" in itself, may be associated to movie critics, sociology reviews articles … through such associations. This type of level would be very useful in the TM representation of on line Forum archives, allowing a more effective search for viewpoints than the classical "thread" organization. 6. Human Management : Administration, Authoring and Updating The problem of occurrences selection and updating may present very different features, depending on the scope of the represented environment and system. Hence the methodology should adapt to these differences, the main one being between closed and open systems. In a closed system like e.g. a corporate intranet, where all occurrences of Topics will be internal resources, TM administrator(s) should get some control or at least get streaming information on what is going on at the ground level of resources, such as who is responsible of which resources, on what time basis these resources are updated, what are the foreseeable changes in the system addresses structure etc. If a TM structure is chosen to represent such an intranet, the final objective should be to implement in the system capacities for the TM to take into account whatever changes occur in the resources, with minimal human intervention, in the ideal case confined to the system upper management. Updating of corporate resources at ground level should not imply extra work for the TM management team, as far as the addresses structure of Intranet resources are not changed. Choosing a TM management should even lead to more system stability, and "semantic" definition of resources. In an open system such as a web portal, updating of occurrences is a more tricky work, and will need more human investment. The success of the Open Directory Project [7], reaching the first rank among Web Directories, both in quantity and quality of content by involving in barely two years more than 30,000 contributors to manage more than 300,000 categories, yields fairly good evidence that as far as surveying the Web for pertinent and updated resources in a given field, "Humans do it better". That comment on the success of ODP does not change the above remarks on the incapacity of its very hierarchical structure to deal with complexity. But the generalization of RDF [8] or other kind of semantic metadata [9] may change the problem in the future. The vision of future Web portals will certainly include large Topic Map structures linked to powerful semantic query engines. 7. Mondeca Topic Navigator Documentation and the Semantopic Universe The above methodology has been evaluated and refined along with development of some Mondeca Topic Navigator applications. Two fairly different situations have been addressed :
This is an example of somehow simple and closed environment. The objective level is there limited to resources created on purpose, in the form of plain HTML pages, each describing either a general concept in Topic Maps terminology, or a particular concept used by the tool, or software function, either an example illustrating one of the above. Each resource is represented one-to-one by a Topic in the Topic Map. Three levels of associations have been created : "Definition" associations link concepts to concepts, like in a dictionary. "Management" associations link functions to the concepts they manage. "Illustration" associations link examples to the concepts they illustrate. To link this documentation to a more complex use case, examples have been chosen in the following Semantopic Universe
This is a very open environment :
the goal is to build a representation and communication tool for "what is
going on" in the loosely defined community working towards the "new web
paradigm". This has been summed up in the title, "universe" meaning this
environment is open and hopefully in expansion, and the term "semantopic"
forged out a coalescence of "semantic" and "topic". Developed first as a tool for knowledge representation and resources indexing, the Topic Map technology seems bound to become a more pervasive tool, pertinent wherever one needs to tackle complexity. Given the technology is still young, it has to be confronted to a variety of use cases in order to be refined and validated. The present applications development has focused mainly on resources indexing and somehow static knowledge representation. The capacity of the paradigm to get to a really central role in open complex systems will certainly be linked to the development of interactive tools allowing effective human management in dynamic collaborative environments, including the upper level of discourse associations such as agreement, disagreement, example, development, perspective, generalization … That will be the next important challenge to consider, at a time where building collective intelligence, corporate or community knowledge, are considered to be among the main economic and social issues for the years to come. Building such tools will need there again specific implementations, but the strong Topic Map conceptual core seems in good position to play a central paradigm role in those predictable developments. [1] Resource links on complexity : http://www.brint.com/Systems.htm [2] Resource links on fractals : http://mathforum.org/library/topics/fractals/ [3] Cycorp Ontology : http://www.cyc.com/ [4] Standard Upper Ontology, an IEEE project : http://suo.ieee.org/ [5] Bringing Knowledge Technologies to the Classrom : http://www.thinkalong.com/JP/KT2001.pdf [6] Dewey’s Decimal Classification : http://www.oclc.org/dewey/about/about_the_ddc.htm [7] Open Directory Projet : http://dmoz.org/ [8] Resource Description Framework : http://www.w3.org/TR/REC-rdf-syntax [9] See for instance the Dublin Core Initiative : http://purl.org/dc |
© 2001 univers
immedia |