<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	>

<channel>
	<title>Adaptive Information Disclosure (AID)</title>
	<atom:link href="http://adaptivedisclosure.org/feed/" rel="self" type="application/rss+xml" />
	<link>http://adaptivedisclosure.org</link>
	<description>Participating in the VL-e project</description>
	<pubDate>Mon, 08 Aug 2011 12:51:30 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.7.1</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>CWA prototype - Calpain 3 (continued)</title>
		<link>http://adaptivedisclosure.org/cwa-prototype-calpain-3-continued-2/</link>
		<comments>http://adaptivedisclosure.org/cwa-prototype-calpain-3-continued-2/#comments</comments>
		<pubDate>Tue, 05 May 2009 20:45:43 +0000</pubDate>
		<dc:creator>Marco</dc:creator>
		
		<category><![CDATA[CWA-NL-prototype]]></category>

		<category><![CDATA[CWA]]></category>

		<guid isPermaLink="false">http://wongiseng.homelinux.org/?p=201</guid>
		<description><![CDATA[Update from latest telcon by Eric:

The knowledge base triples do not contain human readable labels (yet); these could come from UMLS, but the UMLS upload was not yet successful (it contains a limited set of triples.
Another approach may be to try to create REST calls to public databases (i.e. URIs).
The cwa repository should contain lovd [...]]]></description>
			<content:encoded><![CDATA[<p>Update from latest telcon by Eric:</p>
<ul>
<li>The knowledge base triples do not contain human readable labels (yet); these could come from UMLS, but the UMLS upload was not yet successful (it contains a limited set of triples.</li>
<li>Another approach may be to try to create REST calls to public databases (i.e. URIs).</li>
<li>The cwa repository should contain lovd data now.</li>
</ul>
<p>I will try to reach LOVD data, and perhaps some other data to link out from.</p>
<p>First I had to figure out how to do regex on parts of URIs; there is a string representation function; this works:<br />
<code>PREFIX cwa: &lt;http://www.nbic.nl/cwa#&gt;<br />
PREFIX rdf: &lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#&gt;<br />
SELECT ?cwa_concept ?relation ?cwa_concept2<br />
WHERE {<br />
?cwa_concept ?relation ?cwa_concept2.<br />
FILTER (regex(str(?cwa_concept),&#8221;C3000052&#8243;,&#8221;i&#8221;))<br />
}</code></p>
<p>This filter<br />FILTER (regex(str(?cwa_concept),&#8221;lovd&#8221;,&#8221;i&#8221;) || regex(str(?relation),&#8221;lovd&#8221;,&#8221;i&#8221;) || regex(str(?cwa_concept2),&#8221;lovd&#8221;,&#8221;i&#8221;))<br />in the query above did not give any results; I&#8217;ve asked Erik what I should do to get something from lovd.</p>
<p></p>
]]></content:encoded>
			<wfw:commentRss>http://adaptivedisclosure.org/cwa-prototype-calpain-3-continued-2/feed/</wfw:commentRss>
		</item>
		<item>
		<title>CWA prototype - Calpain 3 (continued)</title>
		<link>http://adaptivedisclosure.org/cwa-prototype-calpain-3-continued/</link>
		<comments>http://adaptivedisclosure.org/cwa-prototype-calpain-3-continued/#comments</comments>
		<pubDate>Tue, 05 May 2009 08:49:11 +0000</pubDate>
		<dc:creator>Marco</dc:creator>
		
		<category><![CDATA[FI-AID]]></category>

		<guid isPermaLink="false">http://wongiseng.homelinux.org/?p=194</guid>
		<description><![CDATA[I have difficulty finding a path to more information about Calpain-3 via the umls identifier C3000052. This query
PREFIX umls: &#60;http://umls.nlm.nih.gov/&#62;
SELECT ?umls_concept ?umls_relation ?umls_concept2
WHERE {
?umls_concept ?umls_relation ?umls_concept2.
FILTER (?umls_concept = umls:C3000052)
}
has no results, while filtering on umls:A8400986 does (used this id just as a test).
Because of this and that the response time for broad searches on umls [...]]]></description>
			<content:encoded><![CDATA[<p>I have difficulty finding a path to more information about Calpain-3 via the umls identifier C3000052. This query<code><br />
PREFIX umls: &lt;http://umls.nlm.nih.gov/&gt;<br />
SELECT ?umls_concept ?umls_relation ?umls_concept2<br />
WHERE {<br />
?umls_concept ?umls_relation ?umls_concept2.<br />
FILTER (?umls_concept = umls:C3000052)<br />
}</code><br />
has no results, while filtering on umls:A8400986 does (used this id just as a test).</p>
<p>Because of this and that the response time for broad searches on umls on the cloud server (e.g. with regex) is slow, I want to have a look at mesh. The mesh subset is also on our aida server. The mesh id for human Calpain-3 on the <a title="nlm mesh browser" href="http://www.nlm.nih.gov/mesh/2009/mesh_browser/MBrowser.html">nlm mesh website</a> appears to be <a title="nlm mesh page for human Calpain-3" href="http://www.nlm.nih.gov/cgi/mesh/2009/MB_cgi?mode=&amp;index=184919&amp;field=all&amp;HM=&amp;II=&amp;PA=&amp;form=&amp;input=">C105884</a>.</p>
<p>I tried this query on the cloud umls repository:<code><br />
PREFIX umls: &lt;http://umls.nlm.nih.gov/&gt;<br />
SELECT ?umls_concept ?umls_relation ?umls_concept2<br />
WHERE {<br />
?umls_concept ?umls_relation ?umls_concept2.<br />
FILTER (?umls_concept = &lt;http://umls.nlm.nih.gov/MSH/MSH_C105884&gt; || ?umls_concept2 = &lt;http://umls.nlm.nih.gov/MSH/MSH_C105884&gt;)<br />
}</code><br />
gave 36 triples with interesting information, but not a name yet (which is what I am looking for). But it does provide a refseq and a link to a document. However, I have not found this via a link with C3000052 of course.</p>
<h3>Suggestion</h3>
<p>All in all, I have stumbled upon the well known bioinformatics problem of identifiers, which leads me to propose a suggestion for CWA. Although I appreciate any attempt to find unambiguous &#8217;standard&#8217; identifiers, I think experience tells us that most attempts have just added a new types of identifiers. Therefore, I suggest that we focus on reducing missing links (which seems to be my problem here). This will not be a perfect solution, but perhaps a good one <img src='http://adaptivedisclosure.org/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> I imagine we facilitate a community approach, based on these principles:</p>
<ol>
<li>any link to any of the many &#8217;standard&#8217; identifiers links in with a network of interlinked identifiers such that we can reach the identifier we need</li>
<li>any link created by a user adds to the network, thereby reducing missing links (hence, the community is making the network)</li>
<li>tools can be built on this network:
<ol>
<li>e.g. a path traverser for when the required information is a number of triples away along a path. (e.g I was looking for a human readable label); gradually we could build in intelligence, such as SKOS label discovery.</li>
<li>A web service for identifier discovery based on this would be really useful in workflows. Such a service would hide the RDF details from the queries (you could just say: &#8216;umls&#8217;, or &#8216;mesh&#8217; or &#8216;refseq&#8217; for the output type; I imagine the traversed path is a second output for validation)</li>
<li>Would this be a connection with Okkam (Staphano Bocconi)? I think this is doable, useful for users, and enough for a computer scientist to produce papers on (imho I don&#8217;t think Okkam can produce something useful on both sides unless a community is enabled to do most of the work).</li>
</ol>
</li>
</ol>
]]></content:encoded>
			<wfw:commentRss>http://adaptivedisclosure.org/cwa-prototype-calpain-3-continued/feed/</wfw:commentRss>
		</item>
		<item>
		<title>SeRQL to SPARQL for CWA query</title>
		<link>http://adaptivedisclosure.org/serql-to-sparql-for-cwa-query/</link>
		<comments>http://adaptivedisclosure.org/serql-to-sparql-for-cwa-query/#comments</comments>
		<pubDate>Mon, 04 May 2009 12:24:38 +0000</pubDate>
		<dc:creator>Marco</dc:creator>
		
		<category><![CDATA[Queries]]></category>

		<category><![CDATA[CWA]]></category>

		<category><![CDATA[CWA-NL-prototype]]></category>

		<category><![CDATA[RDF query]]></category>

		<guid isPermaLink="false">http://wongiseng.homelinux.org/?p=190</guid>
		<description><![CDATA[I tried to convert a serql query to a sparql query, but I could not find a proper equivalent of the Localname (and Namespace) functions in SeRQL, or documentation/examples of how to do this with FILTER functionsÂ  were too difficult to find. The solution chosen for SPARQL seems more powerful and extensible, but it is [...]]]></description>
			<content:encoded><![CDATA[<p>I tried to convert a serql query to a sparql query, but I could not find a proper equivalent of the Localname (and Namespace) functions in SeRQL, or documentation/examples of how to do this with FILTER functionsÂ  were too difficult to find. The solution chosen for SPARQL seems more powerful and extensible, but it is definitely more complicated and could use more examples (or easier to find with Google).</p>
<p>SeRQL:</p>
<p><code>SELECT DISTINCT cwa_concept, relation, cwa_concept2<br />
FROM<br />
{cwa_concept} relation {cwa_concept2}<br />
WHERE<br />
Localname(cwa_concept)= "C3000052"<br />
USING NAMESPACE<br />
cwa = &lt;http://www.nbic.nl/cwa#&gt;,<br />
rdf = &lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#&gt;<br />
</code><br />
My SPARQL &#8216;almost&#8217; equivalent:<br />
<code><br />
PREFIX cwa: &lt;http://www.nbic.nl/cwa#&gt;<br />
PREFIX rdf: &lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#&gt;<br />
SELECT ?cwa_concept ?relation ?cwa_concept2<br />
WHERE {<br />
?cwa_concept ?relation ?cwa_concept2.<br />
FILTER (?cwa_concept = &lt;http://www.nbic.nl/cwa/umls/C3000052&gt;)<br />
}<br />
</code></p>
]]></content:encoded>
			<wfw:commentRss>http://adaptivedisclosure.org/serql-to-sparql-for-cwa-query/feed/</wfw:commentRss>
		</item>
		<item>
		<title>CWA prototype - Calpain 3</title>
		<link>http://adaptivedisclosure.org/cwa-prototype-calpain-3/</link>
		<comments>http://adaptivedisclosure.org/cwa-prototype-calpain-3/#comments</comments>
		<pubDate>Sun, 03 May 2009 15:32:27 +0000</pubDate>
		<dc:creator>Marco</dc:creator>
		
		<category><![CDATA[CWA-NL-prototype]]></category>

		<category><![CDATA[Calpain]]></category>

		<category><![CDATA[CAPN3]]></category>

		<guid isPermaLink="false">http://wongiseng.homelinux.org/?p=170</guid>
		<description><![CDATA[In little steps I am working towards questions such as: 'Which gene mutations are associated with CAPN3 (1), and which diseases are associated with those mutations (2) and who reported these findings (3). What proteins are associated with those diseases that are also related to epigenetic mechanisms (4), and who is the biology expert to talk about the results (5). How has the data of this month changed compared to that of previous months (5).]]></description>
			<content:encoded><![CDATA[<p>Uniprot contains several human versions (disregarding entries described as fragments or &#8216;highly similar to&#8217;)</p>
<p><a href="http://www.uniprot.org/uniprot/P20807">P20807</a><br />
<a href="http://www.uniprot.org/uniprot/Q9BQC8">Q9BQC8</a><br />
<a href="http://www.uniprot.org/uniprot/Q496N2">Q496N2</a><br />
<a href="http://www.uniprot.org/uniprot/Q762C8">Q762C8</a></p>
<p>I will concentrate on <a href="http://www.uniprot.org/uniprot/Q9BQC8">http://www.uniprot.org/uniprot/P20807</a></p>
<p>However, Erik&#8217;s CWA servlet (http://aneurist.erasmusmc.nl/CWA/servlet/pop) tells me the UMLS identifier is <small>umls/C3000052</small>. I think that is used as the identifier in the CWA knowledge base, so I will continue with that.</p>
<p>The query</p>
<pre>SELECT DISTINCT cwa_concept, relation, cwa_concept2
FROM
{cwa_concept} relation {cwa_concept2}
WHERE
Localname(cwa_concept)= "C3000052"
USING NAMESPACE
cwa = &lt;http://www.nbic.nl/cwa#&gt;,
rdf = &lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#&gt;</pre>
<p>gives all relations of the UMLS term identifyer. &#8216;relation&#8217; appears to be one of cwa:related, cwa:relation, rdf:instanceOf; there appears to be no further information (e.g. human readable labels) in this repository.</p>
<p>The SPARQL &#8216;amost&#8217; equivalent (couldn&#8217;t find &#8216;Localname()&#8217; equivalent: see <a title="SeRQL -&gt; SPARQL conversion problem" href="http://wongiseng.homelinux.org/wp-admin/post.php?action=edit&amp;post=190">this post</a>):<br />
<code><br />
PREFIX cwa: &lt;http://www.nbic.nl/cwa#&gt;<br />
PREFIX rdf: &lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#&gt;<br />
SELECT ?cwa_concept ?relation ?cwa_concept2<br />
WHERE {<br />
?cwa_concept ?relation ?cwa_concept2.<br />
FILTER (?cwa_concept = &lt;http://www.nbic.nl/cwa/umls/C3000052&gt;)<br />
}<br />
</code><br />
[CHECK UMLS repository]</p>
]]></content:encoded>
			<wfw:commentRss>http://adaptivedisclosure.org/cwa-prototype-calpain-3/feed/</wfw:commentRss>
		</item>
		<item>
		<title>CWA prototype triples exploration</title>
		<link>http://adaptivedisclosure.org/cwa-prototype-triples-exploration/</link>
		<comments>http://adaptivedisclosure.org/cwa-prototype-triples-exploration/#comments</comments>
		<pubDate>Sun, 03 May 2009 15:30:53 +0000</pubDate>
		<dc:creator>Marco</dc:creator>
		
		<category><![CDATA[CWA-NL-prototype]]></category>

		<category><![CDATA[CWA]]></category>

		<category><![CDATA[RDF query]]></category>

		<guid isPermaLink="false">http://wongiseng.homelinux.org/?p=169</guid>
		<description><![CDATA[Exploring one relation:
http://ec2.homelinux.org:8080/openrdf-workbench/repositories/cwa/explore?resource=%3Chttp%3A%2F%2Fwww.nbic.nl%2Fcwa%2Frelation%2FC0033684%23C3041379%231240640890649%3E
Annotation:

http://www.uniprot.org/uniprot/a16_anoga links trough to Uniprot Protein A16 precursor (http://www.uniprot.org/uniprot/Q93118)


relation concepts are annotated with a uniprot reference (cwa:annotation)
This: &#60;http://www.nbic.nl/cwa/relation/C0033684#C3041379#1240640890649&#62; is a cooccurence relation (cwa:typeRelation cwa:cooccurrence)

It is related to (cwa:relation) &#60;http://www.nbic.nl/cwa/relation/C0033684#C3041379&#62;
I can&#8217;t make out the type of &#60;http://www.nbic.nl/cwa/relation/C0033684#C3041379&#62;
&#60;http://www.nbic.nl/cwa/relation/C0033684#C3041379&#62; cwa:relation &#60;http://www.nbic.nl/cwa/relation/C0033684#C3041379#1240640890649&#62;


cwa:has_query shows the user query (I think)
cwa:related binds relation terms {concept1} cwa:related {concept2}
How do I go [...]]]></description>
			<content:encoded><![CDATA[<p>Exploring one relation:</p>
<p><a title="example relation" href="http://ec2.homelinux.org:8080/openrdf-workbench/repositories/cwa/explore?resource=%3Chttp%3A%2F%2Fwww.nbic.nl%2Fcwa%2Frelation%2FC0033684%23C3041379%231240640890649%3E" target="_blank">http://ec2.homelinux.org:8080/openrdf-workbench/repositories/cwa/explore?resource=%3Chttp%3A%2F%2Fwww.nbic.nl%2Fcwa%2Frelation%2FC0033684%23C3041379%231240640890649%3E</a></p>
<p>Annotation:</p>
<ul>
<li><a href="http://www.uniprot.org/uniprot/a16_anoga">http://www.uniprot.org/uniprot/a16_anoga</a> links trough to Uniprot Protein A16 precursor (<a href="http://www.uniprot.org/uniprot/Q93118">http://www.uniprot.org/uniprot/Q93118</a>)</li>
</ul>
<ul>
<li>relation concepts are annotated with a uniprot reference (cwa:annotation)</li>
<li>This: &lt;http://www.nbic.nl/cwa/relation/C0033684#C3041379#1240640890649&gt; is a cooccurence relation (cwa:typeRelation cwa:cooccurrence)
<ul>
<li>It is related to (cwa:relation) &lt;http://www.nbic.nl/cwa/relation/C0033684#C3041379&gt;</li>
<li>I can&#8217;t make out the type of &lt;http://www.nbic.nl/cwa/relation/C0033684#C3041379&gt;</li>
<li>&lt;http://www.nbic.nl/cwa/relation/C0033684#C3041379&gt; cwa:relation &lt;http://www.nbic.nl/cwa/relation/C0033684#C3041379#1240640890649&gt;</li>
</ul>
</li>
<li>cwa:has_query shows the user query (I think)</li>
<li>cwa:related binds relation terms {concept1} cwa:related {concept2}</li>
<li><em>How do I go from a relation term to a cooccurence?</em></li>
<li><em><br />
</em></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://adaptivedisclosure.org/cwa-prototype-triples-exploration/feed/</wfw:commentRss>
		</item>
		<item>
		<title>CWA prototype class and relation types</title>
		<link>http://adaptivedisclosure.org/cwa-prototype-class-and-relation-types/</link>
		<comments>http://adaptivedisclosure.org/cwa-prototype-class-and-relation-types/#comments</comments>
		<pubDate>Wed, 29 Apr 2009 11:10:22 +0000</pubDate>
		<dc:creator>Marco</dc:creator>
		
		<category><![CDATA[Queries]]></category>

		<category><![CDATA[CWA]]></category>

		<category><![CDATA[RDF query]]></category>

		<guid isPermaLink="false">http://wongiseng.homelinux.org/?p=163</guid>
		<description><![CDATA[CWA classes
SELECT DISTINCT cwa_class
FROM
{cwa_instance} rdf:instanceOf {cwa_class}
USING NAMESPACE
cwa = &#60;http://www.nbic.nl/cwa#&#62;,
rdf = &#60;http://www.w3.org/1999/02/22-rdf-syntax-ns#&#62;
cwa_class:
   cwa:RelationTerm
CWA relations
SELECT DISTINCT cwa_relation
FROM
{cwa_concept1} cwa_relation {cwa_concept2}
WHERE
Namespace(cwa_relation)=cwa:
USING NAMESPACE
cwa = &#60;http://www.nbic.nl/cwa#&#62;,
rdf = &#60;http://www.w3.org/1999/02/22-rdf-syntax-ns#&#62;
Results in:
cwa_relation:
Â Â Â Â Â Â Â  cwa:related
Â Â Â Â Â Â Â  cwa:relation
Â Â Â Â Â Â Â  cwa:typeRelation
Â Â Â Â Â Â Â  cwa:strength
Â Â Â Â Â Â Â  cwa:has_queryÂ Â 
Â Â Â Â Â Â Â  cwa:discovered_by
Â Â Â Â Â Â Â  cwa:timestamp
Â Â Â Â Â Â Â  cwa:annotation
CWA relation types
SELECT DISTINCT cwa_type
FROM
{cwa_concept} cwa:typeRelation {cwa_type}
USING NAMESPACE
cwa = &#60;http://www.nbic.nl/cwa#&#62;,
rdf = &#60;http://www.w3.org/1999/02/22-rdf-syntax-ns#&#62;
cwa_type
        cwa:cooccurrence
]]></description>
			<content:encoded><![CDATA[<h3>CWA classes</h3>
<pre>SELECT DISTINCT cwa_class
FROM
{cwa_instance} rdf:instanceOf {cwa_class}
USING NAMESPACE
cwa = &lt;http://www.nbic.nl/cwa#&gt;,
rdf = &lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#&gt;</pre>
<p>cwa_class:</p>
<pre>   cwa:RelationTerm</pre>
<h3>CWA relations</h3>
<pre>SELECT DISTINCT cwa_relation
FROM
{cwa_concept1} cwa_relation {cwa_concept2}
WHERE
Namespace(cwa_relation)=cwa:
USING NAMESPACE
cwa = &lt;http://www.nbic.nl/cwa#&gt;,
rdf = &lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#&gt;</pre>
<p>Results in:</p>
<pre>cwa_relation:
Â Â Â Â Â Â Â  cwa:related
Â Â Â Â Â Â Â  cwa:relation
Â Â Â Â Â Â Â  cwa:typeRelation
Â Â Â Â Â Â Â  cwa:strength
Â Â Â Â Â Â Â  cwa:has_queryÂ Â 
Â Â Â Â Â Â Â  cwa:discovered_by
Â Â Â Â Â Â Â  cwa:timestamp
Â Â Â Â Â Â Â  cwa:annotation</pre>
<h3>CWA relation types</h3>
<pre>SELECT DISTINCT cwa_type
FROM
{cwa_concept} cwa:typeRelation {cwa_type}
USING NAMESPACE
cwa = &lt;http://www.nbic.nl/cwa#&gt;,
rdf = &lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#&gt;</pre>
<p>cwa_type</p>
<pre>        cwa:cooccurrence</pre>
]]></content:encoded>
			<wfw:commentRss>http://adaptivedisclosure.org/cwa-prototype-class-and-relation-types/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Protected: Apr 10 Meeting Minutes</title>
		<link>http://adaptivedisclosure.org/minutesapr10/</link>
		<comments>http://adaptivedisclosure.org/minutesapr10/#comments</comments>
		<pubDate>Mon, 28 Apr 2008 11:13:12 +0000</pubDate>
		<dc:creator>AID</dc:creator>
		
		<category><![CDATA[FI-AID]]></category>

		<category><![CDATA[minutes]]></category>

		<guid isPermaLink="false">http://wongiseng.homelinux.org/minutesapr10</guid>
		<description><![CDATA[There is no excerpt because this is a protected post.]]></description>
			<content:encoded><![CDATA[<form action="http://adaptivedisclosure.org/wp-pass.php" method="post">
<p>This post is password protected. To view it please enter your password below:</p>
<p><label for="pwbox-60">Password:<br />
<input name="post_password" id="pwbox-60" type="password" size="20" /></label><br />
<input type="submit" name="Submit" value="Submit" /></p></form>
]]></content:encoded>
			<wfw:commentRss>http://adaptivedisclosure.org/minutesapr10/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Taverna workflow enactor op dev.wongiseng.homelinux.org</title>
		<link>http://adaptivedisclosure.org/taverna-workflow-enactor-op-aida9999/</link>
		<comments>http://adaptivedisclosure.org/taverna-workflow-enactor-op-aida9999/#comments</comments>
		<pubDate>Thu, 10 Apr 2008 15:33:02 +0000</pubDate>
		<dc:creator>Marco</dc:creator>
		
		<category><![CDATA[BioAID_dev]]></category>

		<guid isPermaLink="false">http://wongiseng.homelinux.org/taverna-workflow-enactor-op-aida9999</guid>
		<description><![CDATA[Tijdens mijn bezoek in Manchester hebben een aantal myGrid mensen op Scott en mijn verzoek een Taverna workflow enactor onder 9999 geinstalleerd (webapps/remotetaverna). We kunnen hiermee bv experimenteren met workflows laten draaien onder een eigen web pagina (via REST calls), maar ook om vanuit Taverna een workflow niet op je client maar op een server [...]]]></description>
			<content:encoded><![CDATA[<p>Tijdens mijn bezoek in Manchester hebben een aantal myGrid mensen op Scott en mijn verzoek een Taverna workflow enactor onder 9999 geinstalleerd (webapps/remotetaverna). We kunnen hiermee bv experimenteren met workflows laten draaien onder een eigen web pagina (via REST calls), maar ook om vanuit Taverna een workflow niet op je client maar op een server te draaien.</p>
<p>Een paar dingen:<br />
* Tomcat-dev is een paar keer herstart, mijn excuses als iemand daar last van heeft gehad. Het was ongepland.<br />
* Onder tomcat-dev/webapps zijn &#8216;remotetaverna.war&#8217; en remotetaverna erbij gekomen<br />
* In catalina.policy is een regel gezet om voor de database die bij remotetaverna hoort (Edgar, misschien wil je die bekijken).<br />
* Die database staat in /home/aida/remoteTavernaDB<br />
* Login is admin en wachtwoord is geheim, maar het liefst je eigen naam en wachtwoord aanmaken<br />
* De runner is op adres: http://dev.wongiseng.homelinux.org/remotetaverna/v1/<br />
* Op myExperiment heb ik alvast een &#8216;runner&#8217; aangemaakt: http://www.myexperiment.org/runners/5</p>
]]></content:encoded>
			<wfw:commentRss>http://adaptivedisclosure.org/taverna-workflow-enactor-op-aida9999/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Complications with mesh</title>
		<link>http://adaptivedisclosure.org/complications-with-mesh/</link>
		<comments>http://adaptivedisclosure.org/complications-with-mesh/#comments</comments>
		<pubDate>Sun, 30 Mar 2008 22:05:55 +0000</pubDate>
		<dc:creator>Marco</dc:creator>
		
		<category><![CDATA[BioAID_dev]]></category>

		<category><![CDATA[DocSearch]]></category>

		<guid isPermaLink="false">http://wongiseng.homelinux.org/complications-with-mesh</guid>
		<description><![CDATA[I would like to limit the search results for the following related reasons:

Protein names are not unique between organisms. They are closer to unique within an organism. Hence, if I could limit the search to papers e.g. in the human context, I would be more certain of the meaning of a protein name.
Martijn Scheumie&#8217;s uniprot [...]]]></description>
			<content:encoded><![CDATA[<p>I would like to limit the search results for the following related reasons:</p>
<ul>
<li>Protein names are not unique between organisms. They are closer to unique within an organism. Hence, if I could limit the search to papers e.g. in the human context, I would be more certain of the meaning of a protein name.</li>
<li>Martijn Scheumie&#8217;s uniprot service that I use for validation works on human uniprots only. Without limiting the documents to search, it is more likely that I will erroneously instantiate a protein with a human uniprot id, while it is e.g. a yeast protein.</li>
<li>More extracted proteins would be valid human proteins if I could preselect papers on organism.</li>
</ul>
<p>mesh does not seem a solution:</p>
<ul>
<li>mesh headings include &#8216;organism&#8217;, but in my very rough survey it seems this categorisation is not always used. Many papers returned by &#8216;EZH2&#8242; that are at least relevant to human cell biology are not tagged with &#8216;Humans&#8217; or any organism.</li>
</ul>
<p>adding &#8216;human&#8217; and synonyms to the search query (i.e. not only mesh) is not a solution</p>
<ul>
<li>abstracts do not necessarily contain a reference to the organism in which context it is written (experts can tell or it is not so important for expert readers).</li>
</ul>
<p>concept profiles could be useful</p>
<ul>
<li>A concept profile might be indicative of the organism the paper is about.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://adaptivedisclosure.org/complications-with-mesh/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Complications of extracting protein names</title>
		<link>http://adaptivedisclosure.org/complications-of-extracting-protein-names/</link>
		<comments>http://adaptivedisclosure.org/complications-of-extracting-protein-names/#comments</comments>
		<pubDate>Sun, 30 Mar 2008 08:02:58 +0000</pubDate>
		<dc:creator>Marco</dc:creator>
		
		<category><![CDATA[BioAID_dev]]></category>

		<category><![CDATA[NER]]></category>

		<category><![CDATA[protein]]></category>

		<category><![CDATA[uniprot]]></category>

		<guid isPermaLink="false">http://wongiseng.homelinux.org/complications-of-extracting-protein-names</guid>
		<description><![CDATA[I tried applyCRF, experiences:

The list output combines the &#8216;B-Protein&#8217; and &#8216;I-Protein&#8217; of the other output formats
The resulting names are often more than a gene symbol (e.g. &#8216;NRG1 gene&#8217;)
Sometimes a B-Protein is truncated and a gene symbol extension is the first I-Protein (e.g. &#8216;DNAse&#8217;, &#8216;I&#8217;)
B-Proteins can have little mistakes (e.g. &#8216;STA1,&#8217;), with regexp I clear comma&#8217;s [...]]]></description>
			<content:encoded><![CDATA[<p>I tried applyCRF, experiences:</p>
<ul>
<li>The list output combines the &#8216;B-Protein&#8217; and &#8216;I-Protein&#8217; of the other output formats</li>
<li>The resulting names are often more than a gene symbol (e.g. &#8216;NRG1 gene&#8217;)</li>
<li>Sometimes a B-Protein is truncated and a gene symbol extension is the first I-Protein (e.g. &#8216;DNAse&#8217;, &#8216;I&#8217;)</li>
<li>B-Proteins can have little mistakes (e.g. &#8216;STA1,&#8217;), with regexp I clear comma&#8217;s and points at the end of B-Proteins</li>
<li>Conclusion: I do everything: list output (B-Protein+I-Protein), B-Protein only (with and without [,.]) before validation</li>
</ul>
<p>I tested applyCRF with Matijn Schuemie&#8217;s UniProt service. The purpose is to filter our false positives. I found that</p>
<ul>
<li>Many proteins names give no result; these are generally not human (Scheumie&#8217;s service is human only)</li>
<li>The DDBJ service, GetGene_DDBJentry or GetProd_DDBJentry from Japan can also be used
<ul>
<li>works on many organisms</li>
<li>multiple results: from which organism is the extracted protein name?</li>
<li>more unique names could be retrieved if the organism is know</li>
</ul>
</li>
<li>Conclusion: organism is essential to identify a protein name: a gene symbol is (more or less) unique within a species. Add organism name to document search? What to do when id is not unique?</li>
</ul>
<p>ToDo</p>
<ul>
<li>Compare NER and applyCRF in one workflow</li>
<li>Consider taking up organism name in document search query.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://adaptivedisclosure.org/complications-of-extracting-protein-names/feed/</wfw:commentRss>
		</item>
	</channel>
</rss>

