lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Howe, David" <>
Subject RE: Index size increases disproportionately to size of added field when indexed=false
Date Fri, 16 Feb 2018 20:51:02 GMT

Hi Erick,

Thinking some more about the differences between the two sort orders has suggested another
possibility.  We also have a geo spatial field defined in the index:

  echo "$(date) Creating geoLocation field"
  curl -X POST -H 'Content-type:application/json' --data-binary '{
  }' http://localhost:8983/solr/address/schema

One of the differences between the two sort orders is that when the data is sorted by locality
and post code, it means that addresses that are close to each other will be sorted together
as both locality and postcode have geographic meaning.  So when they are indexed, they will
be indexed in groups of addresses that are quite near to each other.

When the data is sorted by DPID, the order is near random as the dpid has no meaning at all,
so the geo location sequence should be random as well.

I don't have time to test this at the moment, as I need to get my project back on track after
chasing this performance issue but it might ring a bell with somebody.



David Howe
Java Domain Architect
Postal Systems
Level 16, 111 Bourke Street Melbourne VIC 3000

T  0391067904

M  0424036591



Australia Post is committed to providing our customers with excellent service. If we can assist
you in any way please telephone 13 13 18 or visit our website.

The information contained in this email communication may be proprietary, confidential or
legally professionally privileged. It is intended exclusively for the individual or entity
to which it is addressed. You should only read, disclose, re-transmit, copy, distribute, act
in reliance on or commercialise the information if you are authorised to do so. Australia
Post does not represent, warrant or guarantee that the integrity of this email communication
has been maintained nor that the communication is free of errors, virus or interference.

If you are not the addressee or intended recipient please notify us by replying direct to
the sender and then destroy any electronic or paper copy of this message. Any views expressed
in this email communication are taken to be those of the individual sender, except where the
sender specifically attributes those views to Australia Post and is authorised to do so.

Please consider the environment before printing this email.
View raw message