Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm
Precedence: bulk
Reply-To: solr-user@lucene.apache.org
Message-ID: <DUB406-EAS3409FEFA71A1ED735153CACB0ED0@phx.gbl>
From: Gian Maria Ricci - aka Alkampfer <alkampfer@nablasoft.com>
To: <solr-user@lucene.apache.org>
Subject: Best practice for incremental Data Import Handler
Date: Mon, 14 Dec 2015 18:29:34 +0100
MIME-Version: 1.0
Content-Type: multipart/related;
	boundary="----=_NextPart_000_00AE_01D1369D.61BC97D0"
Content-Language: it
Thread-Index: AdE2lGqwAsK2ucOwQuKRyvb9pz9/hw==
Sender: <outlook_288fbf38c031d5f3@outlook.com>

------=_NextPart_000_00AE_01D1369D.61BC97D0
Content-Type: multipart/alternative;
	boundary="----=_NextPart_001_00AF_01D1369D.61BC97D0"

------=_NextPart_001_00AF_01D1369D.61BC97D0
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: 7bit

Hi,

 
I just want some feedback on best practice to run incremental DIH. During
last years I always preferred to have dedicated application that pushes data
inside ElasticSearch / Solr, but now I have a situation where we are forced
to use DIH.

 
I have several SQL Server database with a column of type timestamp (I'm
trying to understand if it is possible to have a standard DateTime column).

 
In the past I've written a super simple C# routine that executes these macro
steps

 
1)      Query solr to understand if the DIH is running (to avoid problem if
multiple instances fired togheter)

2)      Query solr to get the document with higher timestamp value

3)      Launch DIH passing the higer timestamp value to do incremental
population (Greater than or equal)

4)      Monitor DIH and wait for it to finish.

 
I never had problem with this approach, but actually I'm wondering if there
is some better approach instead of having a custom routine that manage
running DIH. Also I'm in a situation where we are not allowed to run C#
code, so we should rewrite that simple program in Node.js or plain bash
shell. My aim is not reimplementing the wheel J.

 
Thanks for any suggestion you can give me.

--
Gian Maria Ricci
Cell: +39 320 0136949

 <http://mvp.microsoft.com/en-us/mvp/Gian%20Maria%20Ricci-4025635>
<http://www.linkedin.com/in/gianmariaricci>
<https://twitter.com/alkampfer>   <http://feeds.feedburner.com/AlkampferEng>


------=_NextPart_001_00AF_01D1369D.61BC97D0
Content-Type: text/html; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable

<html xmlns:v=3D"urn:schemas-microsoft-com:vml" =
xmlns:o=3D"urn:schemas-microsoft-com:office:office" =
xmlns:w=3D"urn:schemas-microsoft-com:office:word" =
xmlns:m=3D"http://schemas.microsoft.com/office/2004/12/omml" =
xmlns=3D"http://www.w3.org/TR/REC-html40"><head><META =
HTTP-EQUIV=3D"Content-Type" CONTENT=3D"text/html; =
charset=3Dus-ascii"><meta name=3DGenerator content=3D"Microsoft Word 15 =
(filtered medium)"><!--[if !mso]><style>v\:* =
{behavior:url(#default#VML);}
o\:* {behavior:url(#default#VML);}
w\:* {behavior:url(#default#VML);}
.shape {behavior:url(#default#VML);}
</style><![endif]--><style><!--
/* Font Definitions */
@font-face
	{font-family:Wingdings;
	panose-1:5 0 0 0 0 0 0 0 0 0;}
@font-face
	{font-family:"Cambria Math";
	panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
	{font-family:Calibri;
	panose-1:2 15 5 2 2 2 4 3 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
	{margin:0cm;
	margin-bottom:.0001pt;
	font-size:11.0pt;
	font-family:"Calibri",sans-serif;
	mso-fareast-language:EN-US;}
a:link, span.MsoHyperlink
	{mso-style-priority:99;
	color:#0563C1;
	text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
	{mso-style-priority:99;
	color:#954F72;
	text-decoration:underline;}
p.MsoListParagraph, li.MsoListParagraph, div.MsoListParagraph
	{mso-style-priority:34;
	margin-top:0cm;
	margin-right:0cm;
	margin-bottom:0cm;
	margin-left:36.0pt;
	margin-bottom:.0001pt;
	font-size:11.0pt;
	font-family:"Calibri",sans-serif;
	mso-fareast-language:EN-US;}
span.EmailStyle17
	{mso-style-type:personal-compose;
	font-family:"Calibri",sans-serif;
	color:windowtext;}
.MsoChpDefault
	{mso-style-type:export-only;
	font-family:"Calibri",sans-serif;
	mso-fareast-language:EN-US;}
@page WordSection1
	{size:612.0pt 792.0pt;
	margin:70.85pt 2.0cm 2.0cm 2.0cm;}
div.WordSection1
	{page:WordSection1;}
/* List Definitions */
@list l0
	{mso-list-id:1023365174;
	mso-list-type:hybrid;
	mso-list-template-ids:1720247662 68157457 68157465 68157467 68157455 =
68157465 68157467 68157455 68157465 68157467;}
@list l0:level1
	{mso-level-text:"%1\)";
	mso-level-tab-stop:none;
	mso-level-number-position:left;
	text-indent:-18.0pt;}
@list l0:level2
	{mso-level-number-format:alpha-lower;
	mso-level-tab-stop:none;
	mso-level-number-position:left;
	text-indent:-18.0pt;}
@list l0:level3
	{mso-level-number-format:roman-lower;
	mso-level-tab-stop:none;
	mso-level-number-position:right;
	text-indent:-9.0pt;}
@list l0:level4
	{mso-level-tab-stop:none;
	mso-level-number-position:left;
	text-indent:-18.0pt;}
@list l0:level5
	{mso-level-number-format:alpha-lower;
	mso-level-tab-stop:none;
	mso-level-number-position:left;
	text-indent:-18.0pt;}
@list l0:level6
	{mso-level-number-format:roman-lower;
	mso-level-tab-stop:none;
	mso-level-number-position:right;
	text-indent:-9.0pt;}
@list l0:level7
	{mso-level-tab-stop:none;
	mso-level-number-position:left;
	text-indent:-18.0pt;}
@list l0:level8
	{mso-level-number-format:alpha-lower;
	mso-level-tab-stop:none;
	mso-level-number-position:left;
	text-indent:-18.0pt;}
@list l0:level9
	{mso-level-number-format:roman-lower;
	mso-level-tab-stop:none;
	mso-level-number-position:right;
	text-indent:-9.0pt;}
ol
	{margin-bottom:0cm;}
ul
	{margin-bottom:0cm;}
--></style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext=3D"edit" spidmax=3D"1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext=3D"edit">
<o:idmap v:ext=3D"edit" data=3D"1" />
</o:shapelayout></xml><![endif]--></head><body lang=3DIT =
link=3D"#0563C1" vlink=3D"#954F72"><div class=3DWordSection1><p =
class=3DMsoNormal>Hi,<o:p></o:p></p><p =
class=3DMsoNormal><o:p>&nbsp;</o:p></p><p class=3DMsoNormal><span =
lang=3DEN-US>I just want some feedback on best practice to run =
incremental DIH. During last years I always preferred to have dedicated =
application that pushes data inside ElasticSearch / Solr, but now I have =
a situation where we are forced to use DIH.<o:p></o:p></span></p><p =
class=3DMsoNormal><span lang=3DEN-US><o:p>&nbsp;</o:p></span></p><p =
class=3DMsoNormal><span lang=3DEN-US>I have several SQL Server database =
with a column of type timestamp (I&#8217;m trying to understand if it is =
possible to have a standard DateTime column).<o:p></o:p></span></p><p =
class=3DMsoNormal><span lang=3DEN-US><o:p>&nbsp;</o:p></span></p><p =
class=3DMsoNormal><span lang=3DEN-US>In the past I&#8217;ve written a =
super simple C# routine that executes these macro =
steps<o:p></o:p></span></p><p class=3DMsoNormal><span =
lang=3DEN-US><o:p>&nbsp;</o:p></span></p><p class=3DMsoListParagraph =
style=3D'text-indent:-18.0pt;mso-list:l0 level1 lfo1'><![if =
!supportLists]><span lang=3DEN-US><span =
style=3D'mso-list:Ignore'>1)<span style=3D'font:7.0pt "Times New =
Roman"'>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; =
</span></span></span><![endif]><span lang=3DEN-US>Query solr to =
understand if the DIH is running (to avoid problem if multiple instances =
fired togheter)<o:p></o:p></span></p><p class=3DMsoListParagraph =
style=3D'text-indent:-18.0pt;mso-list:l0 level1 lfo1'><![if =
!supportLists]><span lang=3DEN-US><span =
style=3D'mso-list:Ignore'>2)<span style=3D'font:7.0pt "Times New =
Roman"'>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; =
</span></span></span><![endif]><span lang=3DEN-US>Query solr to get the =
document with higher timestamp value<o:p></o:p></span></p><p =
class=3DMsoListParagraph style=3D'text-indent:-18.0pt;mso-list:l0 level1 =
lfo1'><![if !supportLists]><span lang=3DEN-US><span =
style=3D'mso-list:Ignore'>3)<span style=3D'font:7.0pt "Times New =
Roman"'>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; =
</span></span></span><![endif]><span lang=3DEN-US>Launch DIH passing the =
higer timestamp value to do incremental population (Greater than or =
equal)<o:p></o:p></span></p><p class=3DMsoListParagraph =
style=3D'text-indent:-18.0pt;mso-list:l0 level1 lfo1'><![if =
!supportLists]><span lang=3DEN-US><span =
style=3D'mso-list:Ignore'>4)<span style=3D'font:7.0pt "Times New =
Roman"'>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; =
</span></span></span><![endif]><span lang=3DEN-US>Monitor DIH and wait =
for it to finish.<o:p></o:p></span></p><p class=3DMsoNormal><span =
lang=3DEN-US><o:p>&nbsp;</o:p></span></p><p class=3DMsoNormal><span =
lang=3DEN-US>I never had problem with this approach, but actually =
I&#8217;m wondering if there is some better approach instead of having a =
custom routine that manage running DIH. Also I&#8217;m in a situation =
where we are not allowed to run C# code, so we should rewrite that =
simple program in Node.js or plain bash shell. My aim is not =
reimplementing the wheel </span><span lang=3DEN-US =
style=3D'font-family:Wingdings'>J</span><span =
lang=3DEN-US>.<o:p></o:p></span></p><p class=3DMsoNormal><span =
lang=3DEN-US><o:p>&nbsp;</o:p></span></p><p class=3DMsoNormal><span =
lang=3DEN-US>Thanks for any suggestion you can give =
me.<o:p></o:p></span></p><p class=3DMsoNormal =
style=3D'mso-margin-top-alt:auto;mso-margin-bottom-alt:auto;background:wh=
ite'><span =
style=3D'font-size:12.0pt;font-family:"Arial",sans-serif;color:#1F497D;ms=
o-fareast-language:IT'>--<br></span><span =
style=3D'font-family:"Arial",sans-serif;color:#1F497D;mso-fareast-languag=
e:IT'>Gian Maria Ricci<br>Cell: +39 320 0136949</span><span =
style=3D'font-size:12.0pt;font-family:"Arial",sans-serif;color:#222222;ms=
o-fareast-language:IT'><o:p></o:p></span></p><p class=3DMsoNormal =
style=3D'mso-margin-top-alt:auto;mso-margin-bottom-alt:auto;background:wh=
ite'><a =
href=3D"http://mvp.microsoft.com/en-us/mvp/Gian%20Maria%20Ricci-4025635" =
target=3D"_blank"><span =
style=3D'font-size:12.0pt;font-family:"Arial",sans-serif;color:#1155CC;ms=
o-fareast-language:IT;text-decoration:none'><img border=3D0 width=3D32 =
height=3D32 id=3D"Picture_x0020_1" =
src=3D"cid:image001.png@01D1369D.61A60290" =
alt=3D"https://ci5.googleusercontent.com/proxy/5oNMOYAeFXZ_LDKanNfoLRHC37=
mAZkVVhkPN7QxMdA0K5JW2m0bm8azJe7oWZMNt8fKHNX1bzrUTd-kIyE40CmwT2Mlf8OI=3Ds=
0-d-e1-ft#http://www.codewrecks.com/files/signature/mvp.png"></span></a><=
span =
style=3D'font-size:12.0pt;font-family:"Arial",sans-serif;color:#1F497D;ms=
o-fareast-language:IT'>&nbsp;</span><a =
href=3D"http://www.linkedin.com/in/gianmariaricci" =
target=3D"_blank"><span =
style=3D'font-size:12.0pt;font-family:"Arial",sans-serif;color:#1155CC;ms=
o-fareast-language:IT;text-decoration:none'><img border=3D0 width=3D32 =
height=3D32 id=3D"Picture_x0020_2" =
src=3D"cid:image002.jpg@01D1369D.61A60290" =
alt=3D"https://ci3.googleusercontent.com/proxy/f-unQbmk6NtkHFspO5Y6x4jlIf=
_xrmGLUT3fU9y_7VUHSFUjLs7aUIMdZQYTh3eWIA0sBnvNX3WGXCU59chKXLuAHi2ArWdAcBc=
lKA=3Ds0-d-e1-ft#http://www.codewrecks.com/files/signature/linkedin.jpg">=
</span></a><span =
style=3D'font-size:12.0pt;font-family:"Arial",sans-serif;color:#1F497D;ms=
o-fareast-language:IT'>&nbsp;</span><a =
href=3D"https://twitter.com/alkampfer" target=3D"_blank"><span =
style=3D'font-size:12.0pt;font-family:"Arial",sans-serif;color:#1155CC;ms=
o-fareast-language:IT;text-decoration:none'><img border=3D0 width=3D32 =
height=3D32 id=3D"Picture_x0020_3" =
src=3D"cid:image003.jpg@01D1369D.61A60290" =
alt=3D"https://ci3.googleusercontent.com/proxy/gjapMzu3KEakBQUstx_-cN7gHJ=
_GpcIZNEPjCzOYMrPl-r1DViPE378qNAQyEWbXMTj6mcduIAGaApe9qHG1KN_hyFxQAIkdNSV=
T=3Ds0-d-e1-ft#http://www.codewrecks.com/files/signature/twitter.jpg"></s=
pan></a><span =
style=3D'font-size:12.0pt;font-family:"Arial",sans-serif;color:#1F497D;ms=
o-fareast-language:IT'>&nbsp;</span><a =
href=3D"http://feeds.feedburner.com/AlkampferEng" =
target=3D"_blank"><span =
style=3D'font-size:12.0pt;font-family:"Arial",sans-serif;color:#1155CC;ms=
o-fareast-language:IT;text-decoration:none'><img border=3D0 width=3D32 =
height=3D32 id=3D"Picture_x0020_4" =
src=3D"cid:image004.jpg@01D1369D.61A60290" =
alt=3D"https://ci5.googleusercontent.com/proxy/iuDOD2sdaxRDvTwS8MO7-CcXch=
pNJX96uaWuvagoVLcjpAPsJi88XeOonE4vHT6udVimo7yL9ZtdrYueEfH7jXnudmi_Vvw=3Ds=
0-d-e1-ft#http://www.codewrecks.com/files/signature/rss.jpg"></span></a><=
span =
style=3D'font-size:12.0pt;font-family:"Arial",sans-serif;color:#1F497D;ms=
o-fareast-language:IT'>&nbsp;</span><span =
style=3D'font-size:12.0pt;font-family:"Arial",sans-serif;color:#222222;ms=
o-fareast-language:IT'><img border=3D0 width=3D32 height=3D32 =
id=3D"Picture_x0020_5" src=3D"cid:image005.jpg@01D1369D.61A60290" =
alt=3D"https://ci6.googleusercontent.com/proxy/EBJjfkBzcsSlAzlyR88y86YXcw=
aKfn3x7ydAObL1vtjJYclQr_l5TvrFx4PQ5qLNYW3yp7Ig66DJ-0tPJCDbDmYAFcamPQehwg=3D=
s0-d-e1-ft#http://www.codewrecks.com/files/signature/skype.jpg"><o:p></o:=
p></span></p><p =
class=3DMsoNormal><o:p>&nbsp;</o:p></p></div></body></html>
------=_NextPart_001_00AF_01D1369D.61BC97D0--

------=_NextPart_000_00AE_01D1369D.61BC97D0--