taverna-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Stian Soiland-Reyes (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TAVERNA-1037) GSOC: Use arcp:// URIs in RO Bundle
Date Wed, 21 Feb 2018 11:27:00 GMT

    [ https://issues.apache.org/jira/browse/TAVERNA-1037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16371261#comment-16371261

Stian Soiland-Reyes commented on TAVERNA-1037:

Hi, [~Gonidelis]! Thank you for your interest in GSOC and Apache Taverna.

According to the [GSOC timeline|https://developers.google.com/open-source/gsoc/timeline] there
is about a month for students to work with communities, this is to help you flesh out the
GSOC idea (say as a Google Doc?) which you will have to submit in middle March, as well as
of course for us to get to know you a bit closer.

For that you may also want to subscribe to the [dev@taverna|https://taverna.incubator.apache.org/community/lists.html#devtaverna]
mailing list and introduce yourself.

Before GSOC proposal time it's good if you have a play with the current code so you understand
roughly what you will be working with.  You can read up on these topics:

* [Java 8 Path API|https://docs.oracle.com/javase/8/docs/api/java/nio/file/Path.html]
* [ro-bundlle tutorial|https://github.com/apache/incubator-taverna-language/tree/master/taverna-robundle]
* [Introduction to RDF with Commons RDF|https://commons.apache.org/proper/commons-rdf/introduction.html]
* [What is Taverna|https://taverna.incubator.apache.org/introduction/]

The RO bundle is a part of Taverna used for storing workflows and workflow run results as
a [Research Object|http://www.researchobject.org/] -- you may not need to go too much into
detail of this yet.

You probably can just start making some example code playing with it, e.g. just add/remove
some dummy files, printing out path.toURI() etc.  You can use [Eclipse|http://www.eclipse.org/downloads/]
or your favourite IDE for Java.

BTW, you can download some Research Objects to play with (made using taverna-robundle) from
https://view.commonwl.org/workflows and clicking "Download as Research Object Bundle" on some

> GSOC: Use arcp:// URIs in RO Bundle
> -----------------------------------
>                 Key: TAVERNA-1037
>                 URL: https://issues.apache.org/jira/browse/TAVERNA-1037
>             Project: Apache Taverna
>          Issue Type: New Feature
>          Components: Taverna Language
>         Environment: Java, HTTP
>            Reporter: Stian Soiland-Reyes
>            Priority: Major
>              Labels: RFC, gsoc2018, java, uri
> This is a project idea for [Google Summer of Code|https://summerofcode.withgoogle.com/] (GSOC).
To discuss this or other ideas with your potential mentor from the Apache Taverna project,
sign up and post to the [dev@taverna|https://taverna.incubator.apache.org/community/lists.html#devtaverna] list,
including "[GSOC]" in the subject. You may also comment on this Jira issue if you have created
an account.
> --
> The [ro-bundle|https://github.com/apache/incubator-taverna-language/tree/master/taverna-robundle]
module of Taverna Language currently uses app:// URIs in its Java [FileSystem|https://docs.oracle.com/javase/8/docs/api/java/nio/file/FileSystem.html]
URIs and thus also in its RDF loading.
> This was in accordance with [RO bundle spec|https://researchobject.github.io/specifications/bundle/#absolute-uris]
– however the app:// URI scheme has been since abandoned.
> The arcp URI scheme has been proposed as an alternative to describe paths within an archive
(e.g. ZIP file): [https://tools.ietf.org/id/draft-soilandreyes-arcp-03.html] (This Internet-Draft
is progressing towards an RFC)
> This proposal suggests to modify RO Bundle to use arcp:// URIs – but not just modify
app://bf5a0cab-86d7-40da-b588-1ce4953ae13d/ to arcp://uuid,bf5a0cab-86d7-40da-b588-1ce4953ae13d/
- but to support the other mechanisms suggested by arcp in [https://tools.ietf.org/id/draft-soilandreyes-arcp-03.html#rfc.section.4.1]
> That is, it should be possible to open an RO Bundle from a fixed URL as identifier, or
using its sha256 checksum in readonly mode.
> It should also be possible to lookup an RO Bundle URI from a .well-known endpoint as
defined in [https://tools.ietf.org/id/draft-soilandreyes-arcp-03.html#rfc.section.4.4]
> Extensions to this project could be to add a Java URL handler so that URLs from an opened
RO Bundle file system also can be used as java.net.URLs. Also it could develop a new arcp-java
module similar to the reference Python implementation https://pypi.python.org/pypi/arcp
> Prospective students are expected to participate in the Apache [Taverna community|https://taverna.incubator.apache.org/community/lists.html#devtaverna]
- but are also welcome to join the [IETF |https://www.ietf.org]review process  in reviewing
or improving the arcp Internet-Draft to progress it towards RFC.
> Suggested mentor: Stian Soiland-Reyes

This message was sent by Atlassian JIRA

View raw message