Sunday, April 1, 2018

FW: How do I create a schema file for FIX data in Solr

-----Original Message-----
From: Raymond Xie [mailto:xie3208080@gmail.com]
Sent: 31 March 2018 23:52
To: solr-user@lucene.apache.org; Hui Xie <xie3208080@gmail.com>
Subject: How do I create a schema file for FIX data in Solr

Hello,

I just started using Solr to create a Searching function on our existing
data.

The existing data is in FIX format sample as below:

8=FIX.4.4 9=653 35=RIO 1=TEST 11=337912000000002 38=1 44=2.0 39=A 40=2
49=VIPER 50=JPNIK01 54=1 55=JNI253D8.OS 56=XSVC 59=0 75=20180350 100=XOSE
10039=viperooe 10241=viperooe 150=A 372=D
122=20180320-08:08:35.038 10066=20180320-08:08:35.038
10436=20180320-08:08:35.038 202=25375.0 52=20180320-08:08:35.088
60=20180320-08:08:35.088 10071=20180320-08:08:35.088
11210=337912000000002 37=337912000000002 10184=337912000000002 201=1
29=4 10438=RIO.4.5 10005=178 10515=178 10518=178 581=13 660=102 1133=G 528=P
10104=Y 10202=APMKTMAKING 10208=APAC.VIPER.OOE 10217=Y 10292=115
11032=-1 382=0 10537=XOSE 15=JPY 167=OPT 48=179492540 455=179492540
22=101 456=101 151=1.0 421=JPN 10=200


all the red tags (I didn't mark all of them) are fields with definition from
FIX standard, I need to create index on all the tags, how do I start?

Thank you very much.

*------------------------------------------------*
*Sincerely yours,*


*Raymond*

On Sat, Mar 31, 2018 at 12:24 AM, Randy Fradin <randy.fradin@gmail.com>
wrote:

> I have a SolrCloud cluster (version 6.5.1) with around 3300 cores per
> instance. I've been investigating what is driving heap utilization
> since it is higher than I expected. I took a heap dump and found the
> largest driver of heap utilization is the array of VersionBucket
> objects in the org.apache.solr.update.VersionInfo class. The array is
> size 65536 and there is one per SolrCore instance. Each instance of
> the array is 1.8MB so the aggregate size is 6GB in heap.
>
> I understand from reading the discussion in SOLR-6820 that 65536 is
> the recommended default for this setting now because it results in
> higher document write rates than the old default of 256. I would like
> to reduce my heap utilization and I'm OK with somewhat slower document
> writing throughput. My question is, it is safe to reduce the value of
> numVersionBuckets on all of my existing cores without reindexing my
> data?
>
> My solrconfig.xml contains this for all of my collections:
>
> <updateHandler class="solr.DirectUpdateHandler2">
> <updateLog>
> <str name="dir">${solr.ulog.dir:}</str>
> <int name="numVersionBuckets">${solr.ulog.numVersionBuckets:
> 65536}</int>
> </updateLog>
> </updateHandler>
>
> Assuming it is safe to change, can I just add a vm arg to the Solr
> process like "-Dsolr.ulog.numVersionBuckets=256" to override the value
> for all cores at once? Or do I have to change and re-upload the
> solrconfig.xml files and reload the cores?
>
> Thanks
>

No comments:

Post a Comment