Serendipity

Applying systems thinking to computing, climate and sustainability

  • About Steve
  • Popular Posts
  • Sitemap (blog contents)

Data sharing and boundary objects

24. April 2009 · 1 comment · Categories: climate science, data modeling, EGU 2009 · Tags: collaboration, EGU, ideas, informatics, interdisciplinarity, metadata, models, research, tools

Okay, slow day today, as I took some time out to get my talk ready for this afternoon. In the meantime, it gives me a chance to reflect on a few ideas. For example, I’ve seen many talks this week on data sharing, and had some very juicy discussion over dinner last night with Bryan. Two key challenges seem to stand out in this work: (1) the lack of shared ontology between scientific sub-communities who want to share datasets, and (2) the inevitable separation of data from commentaries on that data (which includes problems of knowledge provenance, and meta-data management). The latter problem is endemic because the data sources and the commentary sources are in different communities, operate asynchronously, and often have very different goals.

There seem to be plenty of people considering the technical aspects of these problems, exploring the use of ontology description languages (e.g. OWL and its relatives), and markup languages (e.g. the many application schemas of GML). But very little emphasis has been placed on the human side of this problem. First, the sociological problem of ontological drift has been ignored – the problem that objects gradually change their meaning as they pass between difference communities. (Chalmers gives an overview of various responses to this problem).

A key problem here is that scientific communities have collaborated reasonably effectively in previous centuries through the use of boundary objects (first characterized by Leigh Star). Boundary Objects are…

…  both plastic enough to adapt to local needs and constraints of the several parties employing them, yet robust enough to maintain a common identity across sites. They are weakly structured in common use, and become strongly structured in individual-site use. They may be abstract or concrete. They have different meanings in different social worlds but their structure is common enough to more than one world to make them recognizable means of translation. [Wikipedia defn]

Examples include taxonomies, maps, scientific methods, etc. I talked a little with Leigh about boundary objects in the early 90′s, and I observed that boundary objects are very effective because they represent a minimal shared understanding – just enough so that different communities have some common frames of reference, but no so much than anyone has to work hard on ensuring they have the same mental models.

The problem is that boundary objects fool us into thinking that we share much more meaning than we really do. When we try and embed our boundary objects into computer systems (thinking that we have a shared semantics), we bind the boundary objects to a particular interpretation, and hence they lose their plasticity. As soon as we do this, they are no longer boundary objects. They are brittle reflections of the real boundary objects. Whereas before, the objects themselves could be adapted to each communities needs, now the communities themselves must do the adapting, to the fixed definitions of these frozen objects. No wonder everyone finds this hard!

So what do we do? I don’t really know, but I have some ideas. Lots of small local ontologies, with loose flexible mappings, between them created on the fly by the communities that use them, social tagging style? Or heavier weight tools from psychology for teasing out mappings between terminologies and concepts used by different expert communities, such as repertory grids (e.g. Shaw and Gaines developed a technique applying Rep Grids for exploring conflicting terminology). Or perhaps more flexible ontology languages built over paraconsistent logics?  I’ve played with all these ideas in the past, and think they all have some value. How to exploit them in practical data sharing systems remains a big open question.

Share this:

  • Pinterest
  • Facebook
  • Twitter
  • Google +1
  • Reddit
  • Digg
  • StumbleUpon
  • LinkedIn
  • Tumblr
  • Print
  • Email

1 Comment

  1. Bryan Lawrence
    April 24, 2009 at 5:24 pm

    Thanks mate! Good stuff to get us going …

  2. Pingback: Open Climate Science or Denial of Service attacks? | Serendipity

  3. Pingback: A sad farewell to Susan Leigh Star | Serendipity

  4. Pingback: Open Questions about Open Source for Open Science | Serendipity

Fresh Serendipity:

  • Simple Climate Models to play with in the classroom
  • Notes for a high school student
  • What I do (in simple words)
  • New Grad Course on Computational Models of Climate Change
  • Successful Predictions (AGU 2012 Tyndall Lecture)
  • How the Jarvis bike lanes became collateral damage in a fictitious war
  • The History of Climate Models
  • Notes from Hurricane Sandy
  • Andrew Wisdom – Cities as Systems of Systems
  • Jeb Brugmann – The Productive City
  • My first year climate course
  • What is Climate Informatics?
  • Don’t feed the trolls
  • Nobody invented the internet!
  • What is Science Literacy?

Recent Tweets

  • RT @arthursmith: RT @seanmcarroll Sitting at #4 on amazon: a physics book full of equations. Take that, conventional wisdom. http://t.co ... 4 hours ago
  • Journalists have to find stuff to write about each time a deadline rolls around; science bloggers can let non-stories stay as non-stories 7 hours ago
  • RT @ClimateOfGavin: Climate sensitivity nostalgia: what I said in 2006: realclimate.org/index.php/arch… 'Plus ça change' indeed... 8 hours ago
  • RT @dana1981: @mat_collins @SMEasterbrook Those uncertanties seem major challenges for research using recent data to estimate ECS (i.e. ... 8 hours ago
  • RT @mat_collins: @SMEasterbrook Probably our point is that we don't know if it is a non-linearity in the feedback, the forcing or the oc ... 10 hours ago
Follow @SMEasterbrook

Archives

  • 2013 (4)
    • January (4)
  • 2012 (30)
    • January (3)
    • February (3)
    • March (2)
    • April (5)
    • May (7)
    • June (1)
    • August (1)
    • September (1)
    • October (3)
    • November (3)
    • December (1)
  • 2011 (87)
    • January (12)
    • February (11)
    • March (12)
    • April (5)
    • May (6)
    • June (4)
    • July (7)
    • August (4)
    • September (6)
    • October (11)
    • November (3)
    • December (6)
  • 2010 (109)
    • January (9)
    • February (14)
    • March (17)
    • April (8)
    • May (6)
    • June (6)
    • July (7)
    • August (11)
    • September (8)
    • October (1)
    • November (14)
    • December (8)
  • 2009 (128)
    • March (27)
    • April (20)
    • May (17)
    • June (9)
    • July (11)
    • August (5)
    • September (6)
    • October (9)
    • November (11)
    • December (13)

Topics

Blogroll

  • A Few Things Ill-Considered
  • All Models Are Wrong
  • Alternative Energy Action Network
  • Azimuth
  • Capital Climate
  • Climate Charts and Graphs
  • Climate Denial Crock of the Week
  • Climate Lab Book
  • ClimateProgress
  • Crock of the Week
  • Deltoid
  • DeSmogBlog
  • Global Warming Art
  • Greenfyre's
  • Gristmill
  • http://www.ecoequity.org/
  • James Empty Blog
  • Left as an Exercise
  • Mind of Dan
  • Monbiot.com
  • More Grumbine Science
  • Only in it for the Gold
  • People and Place
  • RealClimate
  • The Cost of Energy
  • The Intersection
  • The Way Things Break
  • Yale Climate Forum

Meta

  • Register
  • Log in
  • Entries RSS
  • Comments RSS
  • WordPress.org
© 2013 Serendipity. All rights reserved.
Design by picomol. Powered by WordPress.
loading Cancel
Post was not sent - check your email addresses!
Email check failed, please try again
Sorry, your blog cannot share posts by email.