Return-Path: X-Original-To: apmail-airavata-dev-archive@www.apache.org Delivered-To: apmail-airavata-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 3EA74FF32 for ; Fri, 19 Apr 2013 09:09:57 +0000 (UTC) Received: (qmail 32435 invoked by uid 500); 19 Apr 2013 09:09:56 -0000 Delivered-To: apmail-airavata-dev-archive@airavata.apache.org Received: (qmail 32295 invoked by uid 500); 19 Apr 2013 09:09:55 -0000 Mailing-List: contact dev-help@airavata.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@airavata.apache.org Delivered-To: mailing list dev@airavata.apache.org Received: (qmail 32241 invoked by uid 99); 19 Apr 2013 09:09:53 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 19 Apr 2013 09:09:53 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [188.165.246.31] (HELO cosmos2.geomatys.com) (188.165.246.31) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 19 Apr 2013 09:09:47 +0000 Received: from kuroshio.local (unknown [193.48.189.43]) by cosmos2.geomatys.com (Postfix) with ESMTP id 5F47D1891B; Fri, 19 Apr 2013 11:28:01 +0200 (CEST) Message-ID: <517109BC.10409@geomatys.fr> Date: Fri, 19 Apr 2013 11:09:16 +0200 From: Martin Desruisseaux Organization: Geomatys User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.7; rv:17.0) Gecko/20130328 Thunderbird/17.0.5 MIME-Version: 1.0 To: AMILA RANATUNGA CC: "dev@airavata.apache.org" , "dev@sis.apache.org" , Harsha Kumara , Shahani Markus Weerawarana , "dev@oodt.apache.org" Subject: Re: Research project on integrating geoservices with Apache Airavata References: In-Reply-To: Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 8bit X-Virus-Checked: Checked by ClamAV on apache.org Hello Amila Le 18/04/13 20:47, AMILA RANATUNGA a �crit : > Thank you very much for the replies so far here and in SIS dev list as > well. They were really helpful. We are currently writing a research > paper regarding what kind of model that should be used by when > building a geoscience gateway. We intend to discuss issues that a > geoscientist face during his/her research and features that should be > inserted to such a gateway to overcome them. Then we can consider them > during our main project as well. It is appreciated if you can point > any research papers or resources where we can see this domain from an > eye of a geoscientist will be really useful. Even a good case study > will be really helpful. There is a chapter in my Ph.D thesis that I wrote 10 years ago [1], but it is in French... However there is some points: Before to start coding open source software, I had a formation on ERDAS Imagine. At that time, the raster data could be either measurement (e.g. altitude in metres), or categories (land, forest, lake, etc.). However for my work in oceanography I needed a mix of both in the same raster: Sea Surface Temperature (SST) measurement, together with some NaN (Not-a-Number) values indicating that the pixel was a cloud, or a land, etc. The software 10 years ago was not allowing that. My study was correlating data from remote sensing image, with fisheries data. From OGC perspective, this is equivalent to getting WCS and WebSensor to work together. Raster and sensor are two very different kind of data, and doing some work of the kind "I want all temperature data at the location and time of each sensor data, and also all temperature data 10 days before the time of each sensor data" was needed. On the remote images side, my study was using 4 different kind of data: Sea Surface Temperature (SST), chlorophyll-a concentration, Sea Level Anomaly (SLA) and Ekman pumping. Each kind of data have very different characteristics in term spatial and temporal coverage, resolution and format. Handling such heterogeneous source of data was a challenge. Indeed, in my review of previous work, I saw many study correlating fish populations with temperature, or correlating fish population with chlorophyll, but I found no study correlating fish population to many parameters taken together (e.g. some condition of temperature in same time than some concentration of chlorophyll-a). Doing such combined study has been a big development effort. However it was 10 years ago, I'm sure the situation is different now. For each time and location of a sensor data, I needed to interpolate the temperature, chlorophyll-a, etc. measurement from the raster data, at the sensor time, 5 days before, 10 days before, etc., compute on the fly some derivative quantities like gradient of temperature (i.e. apply the Sobel operator on rasters of SST data) again 0 day, 5 days, 10 days, etc. before, handle the case of missing (NaN) values (e.g. if got a NaN when interpolating a value using the bi-cubic interpolation, try again with the bi-linear interpolation since it uses less data and thus reduce the risk of getting NaN). So having a software doing the work automatically was crucial. An other way to explain the above paragraph would be to said that for each sensor, we create many (potentially hundred) "virtual sensors" derived from remote sensing data. For example if you had a sensor measuring temperature inside your car, it is like attaching "virtual sensors" to the real sensor, where the virtual sensors behave like any real sensor but using the data from remote sensing images. Of course we have to take in account that the car is moving, to pixel requested on the remote sensing images is always changing. The amount of data produced in the above step was huge. Some statistical tools was needed for evaluating the coefficient of correlation between the above "virtual sensors" and the real ones, so we can trim the "virtual sensors" that do not seem relevant to our study. Again, because of the amount of data, automation is key. Not sure if it is of any help... Martin [1] http://horizon.documentation.ird.fr/exl-doc/pleins_textes/divers09-08/010035115.pdf