From user-return-3247-apmail-uima-user-archive=uima.apache.org@uima.apache.org Sat Nov 06 13:25:15 2010 Return-Path: Delivered-To: apmail-uima-user-archive@www.apache.org Received: (qmail 64590 invoked from network); 6 Nov 2010 13:25:08 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 6 Nov 2010 13:25:08 -0000 Received: (qmail 19843 invoked by uid 500); 6 Nov 2010 13:25:39 -0000 Delivered-To: apmail-uima-user-archive@uima.apache.org Received: (qmail 19704 invoked by uid 500); 6 Nov 2010 13:25:37 -0000 Mailing-List: contact user-help@uima.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@uima.apache.org Delivered-To: mailing list user@uima.apache.org Received: (qmail 19696 invoked by uid 99); 6 Nov 2010 13:25:35 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 06 Nov 2010 13:25:35 +0000 X-ASF-Spam-Status: No, hits=1.9 required=10.0 tests=FORGED_YAHOO_RCVD,FREEMAIL_FROM,RCVD_NUMERIC_HELO,SPF_HELO_PASS,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of gcaug-uima-user@m.gmane.org designates 80.91.229.12 as permitted sender) Received: from [80.91.229.12] (HELO lo.gmane.org) (80.91.229.12) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 06 Nov 2010 13:25:26 +0000 Received: from list by lo.gmane.org with local (Exim 4.69) (envelope-from ) id 1PEilU-0003Op-B3 for user@uima.apache.org; Sat, 06 Nov 2010 14:25:04 +0100 Received: from 195.37.186.61 ([195.37.186.61]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Sat, 06 Nov 2010 14:25:04 +0100 Received: from milen_drenski by 195.37.186.61 with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Sat, 06 Nov 2010 14:25:04 +0100 X-Injected-Via-Gmane: http://gmane.org/ To: user@uima.apache.org From: Milen Subject: Collecting all documents from a directory Date: Sat, 6 Nov 2010 13:10:37 +0000 (UTC) Lines: 13 Message-ID: Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit X-Complaints-To: usenet@dough.gmane.org X-Gmane-NNTP-Posting-Host: sea.gmane.org User-Agent: Loom/3.14 (http://gmane.org/) X-Loom-IP: 195.37.186.61 (Mozilla/5.0 (Windows; U; Windows NT 6.0; en-US) AppleWebKit/534.7 (KHTML, like Gecko) Chrome/7.0.517.41 Safari/534.7) X-Virus-Checked: Checked by ClamAV on apache.org Hello, I am trying to cluster the files from a directory. So i have already implemented the clustering part i am using the library Mallet and it works when i cluster directly the files. But i have to use UIMA. So i wrote a collection reader and i read the files and annotate them as documents. But as an input parameter for the cluster i need a collection of documents, and when i cant manage to do this. With the collection reader the files are collected one by one. What should I implement in order to collect all the files in an array for example and than be able to use them. Regards, Milen