hadoop-common-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Lucene-hadoop Wiki] Update of "Hbase/HbaseShell/Ideas" by udanax
Date Thu, 11 Oct 2007 03:23:20 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Lucene-hadoop Wiki" for change notification.

The following page has been changed by udanax:
http://wiki.apache.org/lucene-hadoop/Hbase/HbaseShell/Ideas

The comment on the change is:
moved to ShellPlans

------------------------------------------------------------------------------
- [[TableOfContents(4)]]
+ deleted
  
- ----
- = Some Ideas for Hbase Shell =
- Volunteers and any comments are welcomed. 
- 
- == Hbase JDBC driver ==
- I've started to think about Java Database Connectivity (JDBC) driver for use with Hbase.

- [[BR]]I am thinking of integrating Zeroboard with this work in progress for a test case.
-- [:udanax:Edward yoon]
- 
-  * Zeroboard is one of the most popular PHP web boards in Asia. 
-   * Our Test & Evaluation Labs : http://labs.zeroboard.com/ 
- 
- 
- === Resources ===
- 
-  * http://java.sun.com/j2se/1.4.2/docs/guide/jdbc/getstart/GettingStartedTOC.fm.html
- 
- = Some Other Stuff for Hadoop project =
- 
- == Korean Morphological Analyzer for lucene ==
- 
-  * Work in progress with NLP lab.
- 
- {{{
- package org.apache.lucene;
- 
- import java.io.IOException;
- import java.io.StringReader;
- import java.util.ArrayList;
- import java.util.List;
- 
- import org.apache.lucene.analysis.Analyzer;
- import org.apache.lucene.analysis.Token;
- import org.apache.lucene.analysis.TokenStream;
- import org.apache.lucene.analysis.kr.KoreanAnalyzer;
- import org.apache.lucene.analysis.kr.KoreanMorphemeAnalyzer;
- 
- public class Test {
-   public static void main(String[] arg) throws IOException {
-     Analyzer a = new KoreanMorphemeAnalyzer();
-     String[] result = getMorphemeArray(a, "조선 제4대 임금 세종이 훈민정음이라는
이름으로 창제하였다");
- 
-     for (int i = 0; i < result.length; i++) {
-       System.out.println(result[i]);
-     }
-     
-     a = new KoreanAnalyzer();
-     result = getMorphemeArray(a, "조선 제4대 임금 세종이 훈민정음이라는
이름으로 창제하였다");
- 
-     for (int i = 0; i < result.length; i++) {
-       System.out.println(result[i]);
-     }
-   }
- 
-   private static String[] getMorphemeArray(Analyzer a, String string)
-       throws IOException {
-     List<String> tmp = new ArrayList<String>();
-     TokenStream ts = a.tokenStream("dummy", new StringReader(string));
- 
-     for (int i = 0; true; i++) {
-       Token t = ts.next();
-       if (t == null) {
-         break;
-       } else {
-         tmp.add(t.toString());
-       }
-     }
- 
-     return tmp.toArray(new String[] {});
-   }
- }}}
- 

Mime
View raw message