Return-Path: Delivered-To: apmail-incubator-uima-user-archive@locus.apache.org Received: (qmail 79834 invoked from network); 16 Jun 2008 19:54:27 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 16 Jun 2008 19:54:27 -0000 Received: (qmail 66881 invoked by uid 500); 16 Jun 2008 19:54:28 -0000 Delivered-To: apmail-incubator-uima-user-archive@incubator.apache.org Received: (qmail 66853 invoked by uid 500); 16 Jun 2008 19:54:28 -0000 Mailing-List: contact uima-user-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: uima-user@incubator.apache.org Delivered-To: mailing list uima-user@incubator.apache.org Received: (qmail 66842 invoked by uid 99); 16 Jun 2008 19:54:28 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 16 Jun 2008 12:54:28 -0700 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of yaakov.chaikin@gmail.com designates 209.85.146.182 as permitted sender) Received: from [209.85.146.182] (HELO wa-out-1112.google.com) (209.85.146.182) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 16 Jun 2008 19:53:39 +0000 Received: by wa-out-1112.google.com with SMTP id m16so4736350waf.6 for ; Mon, 16 Jun 2008 12:53:57 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:message-id:date:from:to :subject:mime-version:content-type:content-transfer-encoding :content-disposition; bh=pTCqKWCHXWpNuQXpwP0BFRoG8ZA7pAgDP45UpJ1OUkI=; b=RY7Hbd1T5zaSjhk27jqzvNBblHOXLAMNtbrG4nbDiOpJl46Kb6YMtG8cS4zWhHnNRG kPq0ZDkeZkTkxsSpkS2YMuvYgk8SXHyOCfsaOB5R0aqYaXxK3oz4tS43MU7km0he6F4A umsHIqbgf66TaL2WiKwJd9DXfhn5Pp97gjAzQ= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:date:from:to:subject:mime-version:content-type :content-transfer-encoding:content-disposition; b=hT4uhYIP7Ye5fYYocuW6h+mVWq+axKzsufsV29Go5aarc6mBofR/6SyHOgr1SFOPMn +kVNpqyZUceKy19X2prOQGvuAjz4ec7YGpQQBQg1erKNEd6EKB4bE0wEbECkW5Lu49qG uEPMi7RaGxbt6U+rnAUQPAV921i2yASr41r70= Received: by 10.114.27.14 with SMTP id a14mr6797281waa.209.1213646037230; Mon, 16 Jun 2008 12:53:57 -0700 (PDT) Received: by 10.114.150.8 with HTTP; Mon, 16 Jun 2008 12:53:57 -0700 (PDT) Message-ID: <15378a7f0806161253x3b990bc4nae4dbdd2fed78b05@mail.gmail.com> Date: Mon, 16 Jun 2008 15:53:57 -0400 From: "Yaakov Chaikin" To: uima-user@incubator.apache.org Subject: Content segmentation MIME-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Content-Disposition: inline X-Virus-Checked: Checked by ClamAV on apache.org Hi, I wanted to find out if UIMA has any concept of content segmentation. Some of the analysis processing is very memory and CPU intensive and if the content happens to be huge (like a book), it will bring the server to a crawl. So, I was wondering if the UIMA framework has any notion of breaking up the content into smaller segments. Thanks, Yaakov.