Return-Path: X-Original-To: apmail-ctakes-dev-archive@www.apache.org Delivered-To: apmail-ctakes-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id A8D3C18538 for ; Sun, 7 Jun 2015 17:04:28 +0000 (UTC) Received: (qmail 97580 invoked by uid 500); 7 Jun 2015 17:04:28 -0000 Delivered-To: apmail-ctakes-dev-archive@ctakes.apache.org Received: (qmail 97525 invoked by uid 500); 7 Jun 2015 17:04:28 -0000 Mailing-List: contact dev-help@ctakes.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@ctakes.apache.org Delivered-To: mailing list dev@ctakes.apache.org Received: (qmail 97514 invoked by uid 99); 7 Jun 2015 17:04:28 -0000 Received: from Unknown (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 07 Jun 2015 17:04:28 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id CF3B2CC344 for ; Sun, 7 Jun 2015 17:04:27 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 0.001 X-Spam-Level: X-Spam-Status: No, score=0.001 tagged_above=-999 required=6.31 tests=[URIBL_BLOCKED=0.001] autolearn=disabled Received: from mx1-us-east.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id N1Sdfv_mZFVv for ; Sun, 7 Jun 2015 17:04:19 +0000 (UTC) Received: from mailsmtp1.childrenshospital.org (mailsmtp1.childrenshospital.org [134.174.13.91]) by mx1-us-east.apache.org (ASF Mail Server at mx1-us-east.apache.org) with ESMTPS id E8F9B43E5C for ; Sun, 7 Jun 2015 17:04:18 +0000 (UTC) Received: from pps.filterd (mailsmtp1.childrenshospital.org [127.0.0.1]) by mailsmtp1.childrenshospital.org (8.15.0.59/8.15.0.59) with SMTP id t57H0pVX030059 for ; Sun, 7 Jun 2015 13:04:12 -0400 Received: from smtpndc2.chboston.org (smtpndc2.chboston.org [10.20.50.105]) by mailsmtp1.childrenshospital.org with ESMTP id 1uusxswk28-1 (version=TLSv1/SSLv3 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT) for ; Sun, 07 Jun 2015 13:04:12 -0400 Received: from pps.filterd (smtpndc2.chboston.org [127.0.0.1]) by smtpndc2.chboston.org (8.15.0.59/8.15.0.59) with SMTP id t57H1nq2014185 for ; Sun, 7 Jun 2015 13:04:11 -0400 Received: from chexhubcas3.chboston.org (internal-ndc-nat-v1260.tch.harvard.edu [10.20.50.4]) by smtpndc2.chboston.org with ESMTP id 1uuttq2a5c-1 (version=TLSv1/SSLv3 cipher=AES128-SHA bits=128 verify=NOT) for ; Sun, 07 Jun 2015 13:04:11 -0400 Received: from CHEXMBX1A.CHBOSTON.ORG ([fe80::3c05:8ca9:55a6:f320]) by CHEXHUBCAS3.CHBOSTON.ORG ([::1]) with mapi id 14.03.0224.002; Sun, 7 Jun 2015 13:04:11 -0400 From: "Chen, Pei" To: "" Subject: Re: Integration of Tika with cTAKES Thread-Topic: Integration of Tika with cTAKES Thread-Index: AQHQoLNt9LnF+igOv0Gf8VjpumB84p2gdpMAgADP9+U= Date: Sun, 7 Jun 2015 17:04:10 +0000 Message-ID: <306914E3-1721-4C71-ACDC-9C3EB5B70AA5@childrens.harvard.edu> References: , In-Reply-To: Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: Content-Type: text/plain; charset="Windows-1252" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10432:,, definitions=2015-06-07_13:,, signatures=0 X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10432:,, definitions=2015-06-07_13:,, signatures=0 X-Proofpoint-Spam-Details: rule=notspam policy=default score=0 kscore.is_bulkscore=2.05391259555654e-15 compositescore=0.623037195214316 phishscore=0 kscore.is_spamscore=0 rbsscore=0.623037195214316 recipient_to_sender_totalscore=0 spamscore=0 urlsuspectscore=0.0230371952143165 adultscore=0 kscore.compositescore=1 circleOfTrustscore=0 malwarescore=0 suspectscore=0 recipient_domain_to_sender_totalscore=0 bulkscore=0 recipient_domain_to_sender_domain_totalscore=0 recipient_to_sender_domain_totalscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-1502090000 definitions=main-1506070237 This looks awesome.=20 Perhaps we can reuse the Tika server on the ctakes demo VM.=20 Sent from my iPhone > On Jun 6, 2015, at 8:40 PM, jay vyas wrote: >=20 > This is awesome; thanks! >=20 > For some of the new ctakes projects where fplks bc are aiming at using it > with big data tooling, the till abstraction might be super useful. > On Jun 6, 2015 8:19 PM, "Mattmann, Chris A (3980)" < > chris.a.mattmann@jpl.nasa.gov> wrote: >=20 >> Hey cTAKES peeps! >>=20 >> We went ahead and integrated Tika with cTAKES for a project I=92m >> working on at JPL. It will be part of the 1.9 release of Tika. You >> can check it out here: >>=20 >> https://urldefense.proofpoint.com/v2/url?u=3Dhttps-3A__wiki.apache.org_t= ika_cTAKESParser&d=3DBQIFaQ&c=3DqS4goWBT7poplM69zy_3xhKwEW14JZMSdioCoppxeFU= &r=3DhuK2MFkj300qccT8OSuuoYhy_xEYujfPwiAxhPVz5WY&m=3DL070DL_WFb_1U_8jGdAbnv= _Ggx5mnsTfV4Jba6oNNU8&s=3DvafA1g4UuwgflDIIfKBwceFE2mgCY3VVMJ_A1PaUPRM&e=3D= =20 >>=20 >>=20 >> Feedback welcomed. cTAKES is rad! >>=20 >> Cheers, >> Chris >>=20 >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ >> Chris Mattmann, Ph.D. >> Chief Architect >> Instrument Software and Science Data Systems Section (398) >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA >> Office: 168-519, Mailstop: 168-527 >> Email: chris.a.mattmann@nasa.gov >> WWW: https://urldefense.proofpoint.com/v2/url?u=3Dhttp-3A__sunset.usc.e= du_-7Emattmann_&d=3DBQIFaQ&c=3DqS4goWBT7poplM69zy_3xhKwEW14JZMSdioCoppxeFU&= r=3DhuK2MFkj300qccT8OSuuoYhy_xEYujfPwiAxhPVz5WY&m=3DL070DL_WFb_1U_8jGdAbnv_= Ggx5mnsTfV4Jba6oNNU8&s=3DgFv8mVTL-qCTpFgkWRIC8vlrkwOdiXHUWq2xtCUTI48&e=3D=20 >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ >> Adjunct Associate Professor, Computer Science Department >> University of Southern California, Los Angeles, CA 90089 USA >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ >>=20 >>=20 >>=20