lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Erick Erickson <>
Subject More on "ant regenerate" target:
Date Wed, 04 Dec 2019 19:32:03 GMT
I have the git pull working for fetching a particular revision of nfkc.txt and the like. Now
TestICUFoldingFilterFactory fails tests. Here's what I could find on that topic:
  public static final Normalizer2 NORMALIZER = Normalizer2.getInstance(
    // TODO: if the wrong version of the ICU jar is used, loading these data files may give
a strange error.
    // maybe add an explicit check?
    "utr30", Normalizer2.Mode.COMPOSE);
eventually calls:
 public Normalizer2Impl load(ByteBuffer bytes) {
    try {
      this.dataVersion = ICUBinary.readHeaderAndDataVersion(bytes, 1316121906, IS_ACCEPTABLE);
which throws
Caused by: ICU data file error:
Header authentication failed, please check if you have a valid ICU data file; data format
4e726d32, format version

0x4e726d32==1316121906, so the data format looks ok to my uninformed eye.

The jar file I have for icu is: icu4j-62.1.jar

I looked at the nfc* files that are now fetched from github and at least ./lucene/analysis/icu/src/data/utr30/nfc.txt
is identical.

I’ll get back to this later this afternoon, meanwhile any pointers?
To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message