<?xml version="1.0" encoding="utf-8" ?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">

    <title type="text">Infobright.org Forums</title>
    <link rel="alternate" type="text/html" href="http://www.infobright.org/Forums/" />
    <link rel="self" type="application/atom+xml" href="http://www.infobright.org/Forums/atom/" />
    <updated></updated>
    <rights>Copyright (c) 2012</rights>
    <generator uri="http://expressionengine.com/" version="1.6.7">ExpressionEngine</generator>
    <id>tag:infobright.org,2012:02:07</id>


    <entry>
      <title>Load Data Into Infobright Community Edition using Third Party ETL Tool Like Talend</title>
      <link rel="alternate" type="text/html" href="http://www.infobright.org/Forums/viewthread/2741/" />      
      <id>tag:infobright.org,2012:Forums/viewthread/.2741</id>
      <published>2012-02-03T04:44:51Z</published>
      <updated></updated>
      <author><name>Rajeev Mehta</name></author>
      <content type="html">
      <![CDATA[
        <p>Hi All,</p>

<p>Can any one tell me How to load data Into Infobright Community Edition db using third party ETL tool like Talend ?</p>

<p>Regards,<br />
Rajeev Kumar Mehta
</p>
      ]]>
      </content>
    </entry>

    <entry>
      <title>Best ETL tool for Infobright</title>
      <link rel="alternate" type="text/html" href="http://www.infobright.org/Forums/viewthread/1635/" />      
      <id>tag:infobright.org,2010:Forums/viewthread/.1635</id>
      <published>2010-07-05T20:01:15Z</published>
      <updated></updated>
      <author><name>Saravana7</name></author>
      <content type="html">
      <![CDATA[
        <p>Hi All,</p>

<p>Can you suggest me the <b>BEST</b> ETL tool which can be used to load data from SQL Server/DB2/Oracle (in scheduled manner) into Infobright?
</p>
      ]]>
      </content>
    </entry>

    <entry>
      <title>Use Infobright Loader and ignore error</title>
      <link rel="alternate" type="text/html" href="http://www.infobright.org/Forums/viewthread/2708/" />      
      <id>tag:infobright.org,2012:Forums/viewthread/.2708</id>
      <published>2012-01-10T11:51:57Z</published>
      <updated></updated>
      <author><name>gre_ubiqube</name></author>
      <content type="html">
      <![CDATA[
        <p>Hi all,</p>

<p>I&#8217;m trying to load a file using infobright loader 
</p><div class="codeblock"><code><span style="color: #000000">
<span style="color: #0000BB">LOAD&nbsp;DATA&nbsp;INFILE&nbsp;</span><span style="color: #DD0000">'/tmp/filetodump'&nbsp;</span><span style="color: #0000BB">INTO&nbsp;TABLE&nbsp;data_20120110&nbsp;FIELDS&nbsp;TERMINATED&nbsp;BY&nbsp;</span><span style="color: #DD0000">','&nbsp;</span><span style="color: #0000BB">ENCLOSED&nbsp;BY&nbsp;</span><span style="color: #DD0000">''</span><span style="color: #007700">;&nbsp;</span>
</span>
</code></div><p>
I got the following error 
</p><div class="codeblock"><code><span style="color: #000000">
<span style="color: #0000BB">ERROR&nbsp;2&nbsp;</span><span style="color: #007700">(</span><span style="color: #0000BB">HY000</span><span style="color: #007700">):&nbsp;</span><span style="color: #0000BB">Wrong&nbsp;data&nbsp;</span><span style="color: #007700">or&nbsp;</span><span style="color: #0000BB">column&nbsp;definition</span><span style="color: #007700">.&nbsp;</span><span style="color: #0000BB">Row</span><span style="color: #007700">:&nbsp;</span><span style="color: #0000BB">82</span><span style="color: #007700">,&nbsp;</span><span style="color: #0000BB">field</span><span style="color: #007700">:&nbsp;</span><span style="color: #0000BB">36.&nbsp;</span>
</span>
</code></div><p>
If i check my file, there is indeed an error at row 82. However, i&#8217;d like to continue to load other part of my file. So my question is, do you think it&#8217;s possible to load a file, ignoring row in errors ?<br />
I know it&#8217;s possible with oracle.
</p>
      ]]>
      </content>
    </entry>

    <entry>
      <title>No Network in VM ICE + Talend + JasperSoft</title>
      <link rel="alternate" type="text/html" href="http://www.infobright.org/Forums/viewthread/2742/" />      
      <id>tag:infobright.org,2012:Forums/viewthread/.2742</id>
      <published>2012-02-03T06:01:38Z</published>
      <updated></updated>
      <author><name>Rajeev Mehta</name></author>
      <content type="html">
      <![CDATA[
        <p>Hi All,</p>

<p>Can any one tell me how to browse internet on VM ICE + Talend + JasperSoft.<br />
Any configuration needed ?<br />
I have install this VM on windows XP machine with internet connection but i am unable to browse internet in the VM.</p>

<p>Thanks,<br />
Rajeev Kumar Mehta
</p>
      ]]>
      </content>
    </entry>

    <entry>
      <title>ELT vs ETL</title>
      <link rel="alternate" type="text/html" href="http://www.infobright.org/Forums/viewthread/2702/" />      
      <id>tag:infobright.org,2012:Forums/viewthread/.2702</id>
      <published>2012-01-02T08:34:02Z</published>
      <updated></updated>
      <author><name>Bernd Hofner</name></author>
      <content type="html">
      <![CDATA[
        <p>Hello,</p>

<p>Are there any experiences or best practices regarding the transformation of large data sets (~100Mio records)?</p>

<p>With an Oracle DB, I made good experiences with Load and then Transform instead of doing an external Transform and then the Load. The &#8220;trick&#8221; was using cascaded &#8220;create table as select&#8221; statements to join and filter a &#8220;base input table&#8221; in consecutive steps with data from several other, large tables (denormalizing and filtering).</p>

<p>So in effect I copied data from table to table within the database.</p>

<p>Is that also a good idea with Infobright (or columnar storage in general)?<br />
Or is it faster to preprocess the data externally (e.g. using Pentaho/Kettle &#8220;table input&#8221; and &#8220;merge join&#8221; transformation steps)?</p>

<p>Best regards,</p>

<p>Bernd
</p>
      ]]>
      </content>
    </entry>

    <entry>
      <title>Example Using Core Connector Lib</title>
      <link rel="alternate" type="text/html" href="http://www.infobright.org/Forums/viewthread/2696/" />      
      <id>tag:infobright.org,2011:Forums/viewthread/.2696</id>
      <published>2011-12-28T09:23:50Z</published>
      <updated></updated>
      <author><name>Bernd Hofner</name></author>
      <content type="html">
      <![CDATA[
        <p>Hello,</p>

<p>I need to load delimited files that have:</p>

<p>* two character column delimiters<br />
* three character record delimiters (none of them CR or LF)<br />
* contain potentially CR/LF characters in the data fields</p>

<p>The given LOAD command can&#8217;t handle this.<br />
Pentaho/Kettle can&#8217;t handle it either (CR or LF is a &#8220;must have&#8221; as record delimiter).</p>

<p>Currently I need to run under Windows.</p>

<p>So I plan to write a custom loader/filter for these (large) files (In Java or C++).<br />
I don&#8217;t want to create intermediate files and thought it might be a good idea to use the core connector lib.<br />
I had a short look into the source but couldn&#8217;t find a starting point.</p>

<p>Is there an example on how to use/call the lib?</p>

<p>I think InfobrightNamedPipeLoader is the &#8220;entry point&#8221; - but I can&#8217;t see where to &#8220;feed in&#8221; the my filtered text data&#8230;</p>

<p>Best regards,</p>

<p>Bernd
</p>
      ]]>
      </content>
    </entry>

    <entry>
      <title>Load Data infile unable to load a file of type binary</title>
      <link rel="alternate" type="text/html" href="http://www.infobright.org/Forums/viewthread/2641/" />      
      <id>tag:infobright.org,2011:Forums/viewthread/.2641</id>
      <published>2011-11-16T18:28:56Z</published>
      <updated></updated>
      <author><name>sumitkk</name></author>
      <content type="html">
      <![CDATA[
        <p>Hi,<br />
I have a text file produced in windows that i am trying to load into infobright using the loader.<br />
Its a tab-delimited text file of type binary(when i run file -i filename ) i get this answer. When i try to load it using the loader it fails and mentions wrong type of column definition. although all columns in the table are of type varchar.<br />
Could someone help me with this<br />
infobright has been installed in red hat linux os</p>

<p>Regards<br />
Sumit
</p>
      ]]>
      </content>
    </entry>

    <entry>
      <title>High CPU for several hrs on LOAD DATA INFILE</title>
      <link rel="alternate" type="text/html" href="http://www.infobright.org/Forums/viewthread/2617/" />      
      <id>tag:infobright.org,2011:Forums/viewthread/.2617</id>
      <published>2011-11-01T17:31:47Z</published>
      <updated></updated>
      <author><name>mtvalek</name></author>
      <content type="html">
      <![CDATA[
        <p>I have a data load which is consuming 99% CPU for several hours now. Cannot determine what the underlying issue is. Is this usually an indicator of a particular problem. I have not run into this problem prior to creating new extract files based on a load of new data to the source system. These are pipe-delimited files. I have confirmed layout is correct.
</p>
      ]]>
      </content>
    </entry>

    <entry>
      <title>Parallel Load</title>
      <link rel="alternate" type="text/html" href="http://www.infobright.org/Forums/viewthread/2612/" />      
      <id>tag:infobright.org,2011:Forums/viewthread/.2612</id>
      <published>2011-10-29T00:20:27Z</published>
      <updated>2011-10-29T00:35:16Z</updated>
      <author><name>sriwisnu</name></author>
      <content type="html">
      <![CDATA[
        <p>How to built parallel load (Distributed Load Processing) on infobright ?.. i need faster&#8230;.<br />
Because I have loaded 100 mio records with record length about 305 byte/record and  conducted  using single load, need  arround 37 minutes, as below:<br />
My Server: Sun xFire x2200 with 8GB RAM</p>

<p>mysql&gt; set @bh_dataformat = &#8216;txt_variable&#8217;;</p>

<p>mysql&gt; load data infile &#8216;/export/home/mscs.100mio.txt&#8217; into table msc fields terminated by &#8216;|&#8217;;<br />
Query OK, 100000000 rows affected (37 min 30.84 sec)<br />
Records: 100000000  Deleted: 0  Skipped: 0  Warnings: 0</p>

<p>please advise..</p>

<p>BR<br />
Sriwisnu
</p>
      ]]>
      </content>
    </entry>

    <entry>
      <title>Infobright 4.0.4 and talend on Windows server 2008 R2</title>
      <link rel="alternate" type="text/html" href="http://www.infobright.org/Forums/viewthread/2579/" />      
      <id>tag:infobright.org,2011:Forums/viewthread/.2579</id>
      <published>2011-10-03T11:23:30Z</published>
      <updated></updated>
      <author><name>Jon Finerfrock</name></author>
      <content type="html">
      <![CDATA[
        <p>last week we spent an inordinate amount of time trying to fix an issue with Talend and IB 4.0.4  on a Win 2008 R2 Server.&nbsp; we could not get the IB fast loader to load data in Talend.&nbsp; we kept getting an access denied message for the named pipes.&nbsp;  long story short we found the solution</p>

<p><br />
Something we did not know was that even though We were set up as administrators on the box you must tell Windows Server 2008 explicitly to run the application (Talend) as the Administrator.&nbsp; You can do this as a one time thing by right clicking on the launch icon for Talend and clicking &#8220;run as administrator&#8221;. (you will get a dialog box asking if you are really sure you want to do this)</p>

<p>OR</p>

<p>if you want to always run the application as an administrator then right click on the launch icon go to properties. On the dialog box that opens, click on the compatibility tab and at the bottom, check the &#8220;Run this program as an administrator&#8221; box.&nbsp; even when you run it this way, you will get the &#8220;are you really sure&#8221; box.</p>

<p><br />
Jon Finerfrock
</p>
      ]]>
      </content>
    </entry>


</feed>
