1. Introduction

    Search engines, like the popular Google, indexing billions of web resources, on the one hand, people and organisations publishing on the web as many pages as they can in order to be known as broadly as possible, on the other hand, both build the wrong idea that any information or knowledge should be shared publicly.
    The real world is less ideal: a large amount of valuable information or useful knowledge is neither directly accessible by anonymous readers nor by web crawlers. Portals limiting access to registered users and databases giving answers only to well formed queries are the main technical impediments to free browsing or crawling. They are many good economic and social reasons for that.
    The real world wide web is then already divided between a "surface web" and a "deep web". See http://en.wikipedia.org/wiki/Invisible_web. The former serves interchanges between identified partners, the later helps to contact new partners. Even between identified partners all information or knowledge is obviously not be shared totally. Moreover the degree of sharing should be adjustable according to reciprocal trust which could vary in the lifetime of partners.
    Our work which encompass database, portal and weblog technologies for p2p networks, wants to address this requirement. In part 2 we present a model we call "iceberg visibility" to specify views on hierarchies of categories that each pair could use to semantically index web resources of his/her own and to define what he/she wants to share with which other (group of) pair. In part 3 we present a web application that demonstrates the simplicity and the utility of the model for a personalised and privacy preserving weblogging.

2. The « iceberg visibility » model

    We would like to deal with publishers who do not compete firstly for the broadest public – popularity – or market share – sales – but for faithful readers they eventually consider as partners for the high exchange and communication rate – mutual quotations, comments and appraisals – established with. 
     Such publishers try to keep in mind distinctions between “the public” and qualified groups of readers and between groups of readers and named – identified – readers. This groups/persons hierarchy should be dynamically set up and updated according to experience of trust in publisher-reader relationship.

    So the first object they will require should be a freely partitionable directory groups/readers per publisher with the following data structure and methods :

    The second object, that “selective” publishers would need, is a “topic map” to index their pieces of publication – short pages called  hereafter “posts”. One could dream, for this map, of a graphical ontology with many semantic links between topics. Not only it would be complex to manage and display but, what is worse, it will prevent reader for an easy top-down recognition and browsing. Even if a publisher, for an expert usage, would rather deal with a graph structure, he or she should refrain himself or herself in order to offer readers firstly classical tree hierarchy as external views to be easily visited by users broadly used to the expand-collapse method associated to this pattern.
    Then, for external specification purpose, let us consider only a topics/posts hierarchy per publisher with the same structure and methods as a directories/files hierarchy where files could have aliases to belong to several sub-directories :

    Now our “selective” publisher should be helped for creating views on his or her topics/posts hierarchy, assigning views to groups and then, by union of views build views of groups and views of readers.
    We will answer to the publisher needs by a unique structure type for all views: a “tree cut” or “shear” which will define our third, and last, class of objects.
Let us define a “shear view” practically:
a shear view of a topic tree

Propositions:

  1. union of shears is a shear - proof element: there is always a highest-deepest visible topic on every branch;
  2. because it does not exist constraint on assignement of views to (sub-)groups, it could exist some sub-group with a shear view not lower than (i.e. not including) some view of some parent super-group - proof: assign any branch shear view to the top group "everyone" and then assign the single topic "Top" for view of a sub-group of "everyone" - but...
  3. the shear view of a sub-group user is lower than (i.e. includes) the shear view of any user of  parents super-groups - proof element: see above the method "insert a reader in a group" !

3. Demonstration prototypes

4. Conclusion

    We think our "iceberg visibility model", with classes of objects defined here above, could be implemented at the database schema level - in an object model or a relational model with integrity constraints - and thus could free the application level from checking the coherency of the publisher policy toward his or her readers. Furthermore it could help - by "try and check" -  the publisher to put each of his or her reader in the group appropriate to the view he or she wants  to give  to him or her.  Defining (1) group hierarchy as a subset of the power set of registred users, (2)  visible topics as all topics above a choosen topic within every branch of the topic tree hierarchy and (3) "shear" view of  any user as the union of views all the groups he or she belongs to have been assigned, are the basic - necessary and sufficient - requirements of our model.

firstly edited on 2005/09/26