From dev-return-357382-archive-asf-public=cust-asf.ponee.io@lucene.apache.org Sat Jun 1 16:40:04 2019 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [207.244.88.153]) by mx-eu-01.ponee.io (Postfix) with SMTP id 1301C18061A for ; Sat, 1 Jun 2019 18:40:03 +0200 (CEST) Received: (qmail 3182 invoked by uid 500); 1 Jun 2019 16:40:02 -0000 Mailing-List: contact dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@lucene.apache.org Delivered-To: mailing list dev@lucene.apache.org Received: (qmail 3055 invoked by uid 99); 1 Jun 2019 16:40:02 -0000 Received: from mailrelay1-us-west.apache.org (HELO mailrelay1-us-west.apache.org) (209.188.14.139) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 01 Jun 2019 16:40:02 +0000 Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id 43E09E0A4B for ; Sat, 1 Jun 2019 16:40:01 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id 984F12459B for ; Sat, 1 Jun 2019 16:40:00 +0000 (UTC) Date: Sat, 1 Jun 2019 16:40:00 +0000 (UTC) From: "Namgyu Kim (JIRA)" To: dev@lucene.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Created] (LUCENE-8817) Combine Nori and Kuromoji DictionaryBuilder MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 Namgyu Kim created LUCENE-8817: ---------------------------------- Summary: Combine Nori and Kuromoji DictionaryBuilder Key: LUCENE-8817 URL: https://issues.apache.org/jira/browse/LUCENE-8817 Project: Lucene - Core Issue Type: New Feature Reporter: Namgyu Kim This issue is related to LUCENE-8816. Currently Nori and Kuromoji Analyzer use the same dictionary structure. (MeCab) If we make combine DictionaryBuilder, we can reduce the code size. But this task may have a dependency on the language. (like HEADER string in BinaryDictionary and CharacterDefinition, methods in BinaryDictionaryWriter, ...) On the other hand, there are many overlapped classes. The purpose of this patch is to provide users of Nori and Kuromoji with the same system dictionary generator. It may take some time because there is a little workload. The work will be based on the latest master, and if the LUCENE-8816 is finished first, it will pull the latest code and proceed. -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org For additional commands, e-mail: dev-help@lucene.apache.org