<?xml version="1.0" encoding="utf-8" ?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">

    <title type="text">Infobright.org Forums</title>
    <link rel="alternate" type="text/html" href="http://www.infobright.org/Forums/register/" />
    <link rel="self" type="application/atom+xml" href="http://www.infobright.org/Forums/register/atom/" />
    <updated></updated>
    <rights>Copyright (c) 2013</rights>
    <generator uri="http://expressionengine.com/" version="1.6.7">ExpressionEngine</generator>
    <id>tag:infobright.org,2013:06:19</id>


    <entry>
      <title>query taking 3&#45;4 minutes in Enterprsie Edition ( 95 million data )</title>
      <link rel="alternate" type="text/html" href="http://www.infobright.org/Forums/register/viewthread/3410/" />      
      <id>tag:infobright.org,2013:Forums/register/viewthread/.3410</id>
      <published>2013-05-25T02:46:13Z</published>
      <updated></updated>
      <author><name>kannan</name></author>
      <content type="html">
      <![CDATA[
        <p>Hi All</p>

<p>&nbsp;  I am using Infobright Enterprise Edition for a while ,and it is performing good ,there is table called WORLDBASE which is having 95 million data,all searches are happening in this WORLDBASE table only (which is taking max of 50 secs),there is new requirement to implement radius search in application in the table WORLDBASE which is having 95 million data ...</p>

<p>Implemented the Radius search using latitude ,langitude in WORLDBASE Table ,Result output is fine but the performance is too slow ,it is taking 3-4 minutes for the Radius search ,Below mentioning all the details ,please suggest me how to improve the performance .</p>

<p>WORLDBASE Table :</p>

<p>CREATE TABLE `WORLDBASE` (<br />
&nbsp; `COMPANYNAME` varchar(120) DEFAULT NULL,<br />
&nbsp; `INTELLECTID` int(20) DEFAULT NULL,<br />
&nbsp; `DUNSNO` int(9) DEFAULT NULL,<br />
&nbsp; `COMPANYTYPECODE` tinyint(1) DEFAULT NULL,<br />
&nbsp; `LOCATIONTYPECODE` tinyint(1) DEFAULT NULL,<br />
&nbsp; `PHYSICALCOUNTRYCODE` int(4) DEFAULT NULL,<br />
&nbsp; `PHYSICALCOUNTYCODE` int(4) DEFAULT NULL,<br />
&nbsp; `PHYSICALSTATECODE` int(4) DEFAULT NULL,<br />
&nbsp; `PHYSICALCITYDIGIT` int(6) DEFAULT NULL,<br />
&nbsp; `WORLDREGIONCODE` tinyint(1) DEFAULT NULL,<br />
&nbsp; `USMETROPOLITANAREACODE` int(4) DEFAULT NULL,<br />
&nbsp; `SIC1DIGIT` int(1) DEFAULT NULL,<br />
&nbsp; `SIC2DIGIT` int(2) DEFAULT NULL,<br />
&nbsp; `SIC3DIGIT` int(3) DEFAULT NULL,<br />
&nbsp; `SIC4DIGIT` int(4) DEFAULT NULL,<br />
&nbsp; `SIC5DIGIT` int(5) DEFAULT NULL,<br />
&nbsp; `SIC6DIGIT` int(6) DEFAULT NULL,<br />
&nbsp; `SIC7DIGIT` int(7) DEFAULT NULL,<br />
&nbsp; `SIC8DIGIT` int(8) DEFAULT NULL,<br />
&nbsp; `NAICCODE1DIGIT` int(1) DEFAULT NULL,<br />
&nbsp; `NAICCODE2DIGIT` int(2) DEFAULT NULL,<br />
&nbsp; `NAICCODE3DIGIT` int(3) DEFAULT NULL,<br />
&nbsp; `NAICCODE4DIGIT` int(4) DEFAULT NULL,<br />
&nbsp; `NAICCODE5DIGIT` int(5) DEFAULT NULL,<br />
&nbsp; `NAICCODE6DIGIT` int(6) DEFAULT NULL,<br />
&nbsp; `SLS` bigint(20) DEFAULT NULL,<br />
&nbsp; `SLSCODE` tinyint(1) DEFAULT NULL,<br />
&nbsp; `EMPLOYEESALLSITES` int(10) DEFAULT NULL,<br />
&nbsp; `EMPLOYEESTHISSITE` int(10) DEFAULT NULL,<br />
&nbsp; `EMPLOYEESALLSITESCODE` tinyint(1) DEFAULT NULL,<br />
&nbsp; `SUBSIDIARYSTATUSCODE` tinyint(1) DEFAULT NULL,<br />
&nbsp; `EXPORTERINDICATORCODE` tinyint(1) DEFAULT NULL,<br />
&nbsp; `MINORITYOWNEDINDICATORCODE` tinyint(1) DEFAULT NULL,<br />
&nbsp; `OWNSRENTSINDICATORCODE` tinyint(1) DEFAULT NULL,<br />
&nbsp; `WOMANOWNEDINDICATORCODE` tinyint(1) DEFAULT NULL,<br />
&nbsp; `YEAR1EMPLOYEEGROWTH` int(10) DEFAULT NULL,<br />
&nbsp; `YEAR1SLSGROWTH` int(20) DEFAULT NULL,<br />
&nbsp; `FACILITYSIZE` int(10) DEFAULT NULL,<br />
&nbsp; `FAXNO` varchar(25) DEFAULT NULL,<br />
&nbsp; `LINEOFBUSINESS` varchar(128) DEFAULT NULL,<br />
&nbsp; `PHONENO` varchar(25) DEFAULT NULL,<br />
&nbsp; `URL` varchar(255) DEFAULT NULL,<br />
&nbsp; `FAMILYTREEDIASCODE` varchar(9) DEFAULT NULL,<br />
&nbsp; `GLOBALULTDUNSNO` int(8) DEFAULT NULL,<br />
&nbsp; `ZIPCODE` varchar(16) DEFAULT NULL,<br />
&nbsp; `CURRENCY` varchar(4) DEFAULT NULL,<br />
&nbsp; `PHYSICALADDRESS` varchar(150) DEFAULT NULL,<br />
&nbsp; `PHYSICALCITY` varchar(64) DEFAULT NULL,<br />
&nbsp; `PHYSICALCITYCODE` varchar(10) DEFAULT NULL,<br />
&nbsp; `PHYSICALSTATE` varchar(50) DEFAULT NULL,<br />
&nbsp; `PHYSICALSTATEPROVINCECODE` varchar(20) DEFAULT NULL,<br />
&nbsp; `PHYSICALCOUNTY` varchar(50) DEFAULT NULL,<br />
&nbsp; `PHYSICALZIPCODE` varchar(16) DEFAULT NULL,<br />
&nbsp; `PHYSICALCOUNTRY` varchar(100) DEFAULT NULL,<br />
&nbsp; `MAILINGADDRESS` varchar(150) DEFAULT NULL,<br />
&nbsp; `MAILINGCITY` varchar(64) DEFAULT NULL,<br />
&nbsp; `MAILINGCITYCODE` varchar(10) DEFAULT NULL,<br />
&nbsp; `MAILINGSTATE` varchar(50) DEFAULT NULL,<br />
&nbsp; `MAILINGSTATEPROVINCECODE` varchar(20) DEFAULT NULL,<br />
&nbsp; `MAILINGCOUNTY` varchar(50) DEFAULT NULL,<br />
&nbsp; `MAILINGZIPCODE` varchar(16) DEFAULT NULL,<br />
&nbsp; `MAILINGCOUNTRY` varchar(100) DEFAULT NULL,<br />
&nbsp; `EXPORTERINDICATOR` varchar(1) DEFAULT NULL,<br />
&nbsp; `LATITUDE` varchar(12) DEFAULT NULL,<br />
&nbsp; `LONGITUDE` varchar(12) DEFAULT NULL,<br />
&nbsp; `MANUFACTURERINDICATOR` varchar(1) DEFAULT NULL,<br />
&nbsp; `MARKETINGPRESCREENSCORE` varchar(1) DEFAULT NULL,<br />
&nbsp; `MINORITYOWNEDINDICATOR` varchar(1) DEFAULT NULL,<br />
&nbsp; `NAICCODE` varchar(6) DEFAULT NULL,<br />
&nbsp; `NAICDESC` varchar(150) DEFAULT NULL,<br />
&nbsp; `NAICS2` varchar(6) DEFAULT NULL,<br />
&nbsp; `NAICSDESC2` varchar(120) DEFAULT NULL,<br />
&nbsp; `NAICS3` varchar(6) DEFAULT NULL,<br />
&nbsp; `NAICSDESC3` varchar(120) DEFAULT NULL,<br />
&nbsp; `NAICS4` varchar(6) DEFAULT NULL,<br />
&nbsp; `NAICSDESC4` varchar(120) DEFAULT NULL,<br />
&nbsp; `NAICS5` varchar(6) DEFAULT NULL,<br />
&nbsp; `NAICSDESC5` varchar(120) DEFAULT NULL,<br />
&nbsp; `NAICS6` varchar(6) DEFAULT NULL,<br />
&nbsp; `NAICSDESC6` varchar(120) DEFAULT NULL,<br />
&nbsp; `SICCODE` varchar(8) DEFAULT NULL,<br />
&nbsp; `SICDESC` varchar(150) DEFAULT NULL,<br />
&nbsp; `PRIMARY8DIGITSIC` varchar(8) DEFAULT NULL,<br />
&nbsp; `PRIMARY8DIGITSICDESC` varchar(150) DEFAULT NULL,<br />
&nbsp; `SIC2` varchar(8) DEFAULT NULL,<br />
&nbsp; `SICDESC2` varchar(150) DEFAULT NULL,<br />
&nbsp; `SIC3` varchar(8) DEFAULT NULL,<br />
&nbsp; `SICDESC3` varchar(150) DEFAULT NULL,<br />
&nbsp; `SIC4` varchar(8) DEFAULT NULL,<br />
&nbsp; `SICDESC4` varchar(150) DEFAULT NULL,<br />
&nbsp; `SIC5` varchar(8) DEFAULT NULL,<br />
&nbsp; `SICDESC5` varchar(150) DEFAULT NULL,<br />
&nbsp; `SIC6` varchar(8) DEFAULT NULL,<br />
&nbsp; `SICDESC6` varchar(150) DEFAULT NULL,<br />
&nbsp; `COMPANYTYPE` varchar(10) DEFAULT NULL,<br />
&nbsp; `OWNSRENTSINDICATOR` varchar(1) DEFAULT NULL,<br />
&nbsp; `LOCATIONTYPE` varchar(1) DEFAULT NULL,<br />
&nbsp; `ACCOUNTINGFIRM` varchar(50) DEFAULT NULL,<br />
&nbsp; `WOMANOWNEDINDICATOR` varchar(1) DEFAULT NULL,<br />
&nbsp; `USMETROPOLITANAREA` varchar(4) DEFAULT NULL,<br />
&nbsp; `GLOBALULTDUNS` varchar(9) DEFAULT NULL,<br />
&nbsp; `GLOBALULTNAME` varchar(120) DEFAULT NULL,<br />
&nbsp; `IMMEDIATEPARENTDUNS` varchar(9) DEFAULT NULL,<br />
&nbsp; `IMMEDIATEPARENTNAME` varchar(120) DEFAULT NULL,<br />
&nbsp; `FAMILYTREEHIERARCHYCODE` varchar(2) DEFAULT NULL,<br />
&nbsp; `FAMILYTREEMEMBERCOUNT` int(10) DEFAULT NULL,<br />
&nbsp; `FAMILYTREERELATIONTYPE` varchar(1) DEFAULT NULL,<br />
&nbsp; `PHONEAREACODE` varchar(9) DEFAULT NULL,<br />
&nbsp; `SUBSIDIARYSTATUS` varchar(1) DEFAULT NULL,<br />
&nbsp; `STOCKEXCHANGE` varchar(50) DEFAULT NULL,<br />
&nbsp; `STOCKSYMBOL` varchar(12) DEFAULT NULL,<br />
&nbsp; `YEAROFFOUNDING` varchar(4) DEFAULT NULL,<br />
&nbsp; `TRADESTYLE` varchar(120) DEFAULT NULL,<br />
&nbsp; `URLTYPE` varchar(1) DEFAULT NULL,<br />
&nbsp; `MSANAME` varchar(100) DEFAULT NULL,<br />
&nbsp; `STATENAME` varchar(100) DEFAULT NULL,<br />
&nbsp; `COMPANYNAMESHORT` varchar(40) DEFAULT NULL<br />
) ENGINE=BRIGHTHOUSE DEFAULT CHARSET=utf8;</p>

<p>created a function for calculating a Distance in MYSQL :<br />
getdistance( has logic code for calculating distance);</p>

<p>SELECT COUNT(1) AS FIELDCOUNT FROM WORLDBASE WHERE getdistance(&#8216;33.560&#8217;, -112.131&#8217;, LATITUDE, LONGITUDE) &lt;= 10</p>

<p>taking 4 mins in WORLDBASE table,pls suggest
</p>
      ]]>
      </content>
    </entry>

    <entry>
      <title>Query performance</title>
      <link rel="alternate" type="text/html" href="http://www.infobright.org/Forums/register/viewthread/3416/" />      
      <id>tag:infobright.org,2013:Forums/register/viewthread/.3416</id>
      <published>2013-05-29T14:12:20Z</published>
      <updated>2013-05-29T15:35:31Z</updated>
      <author><name>casatedu</name></author>
      <content type="html">
      <![CDATA[
        <p>Hi,</p>

<p>I was doing some performance testing and was wondering if the results are in a bulk park with what is &#8220;theoretically&#8221; expected.</p>

<p>I create a table T1 with about 110 columns (halve of them Int and rest date, varchar and char . Most of chars and varchars are defined as a &#8216;lookup&#8217;.</p>

<p>Then I loaded bit over 216 Million records. Then I created exactly the some table T2 and loaded the same load file 15 times, so my T2 has more than 3.2 Billion records.</p>

<p>Than I executed queries:</p>

<p>SELECT B_Name, count(1) FROM T1 GROUP BY B_Name;</p>

<p>the 65 rows were returned after 6sec. <br />
(B_Name is a varchar lookup having 65 unique values)</p>

<p>Then I run</p>

<p>SELECT B_Name, count(1) FROM T2 GROUP BY B_Name;<br />
the 65 rows were returned after 93sec. </p>

<p>look like straight linear scalability (6 * 15 = 90)</p>

<p>So, once again my question is: are both numbers what is expected (actual numbers and the proportional difference)?<br />
Thanks</p>

<p>Hi again,</p>

<p>After I posted this entry I had another query running and this one surprised me a bit.<br />
The table from above has B_Name column that is: varchar(40) DEFAULT NULL COMMENT &#8216;lookup&#8217;,<br />
and B_Name_ID that is defined as an integer and B_Name_Full that is varchar(100) DEFAULT NULL COMMENT &#8216;lookup&#8217;.</p>

<p>Here are the final results:</p>

<p>SELECT B_Name_ID,&nbsp;  count(1) from T2 &nbsp;   -&nbsp; 40 sec.<br />
SELECT B_Name,&nbsp;  &nbsp;  &nbsp; count(1) from T2 &nbsp;   -&nbsp; 90 -100 sec.<br />
SELECT B_Name_Full,&nbsp; count(1) from T2 &nbsp;   -&nbsp; 130 sec.</p>

<p>I was expecting the same result from all queries - was the expectation incorrect?<br />
C_G
</p>
      ]]>
      </content>
    </entry>

    <entry>
      <title>Hadoop &amp;amp; Distributed Load Processor (DLP)</title>
      <link rel="alternate" type="text/html" href="http://www.infobright.org/Forums/register/viewthread/3298/" />      
      <id>tag:infobright.org,2013:Forums/register/viewthread/.3298</id>
      <published>2013-03-22T15:14:10Z</published>
      <updated>2013-03-25T12:02:52Z</updated>
      <author><name>candeal01</name></author>
      <content type="html">
      <![CDATA[
        <p>Hi,</p>

<p>I have Hadoop configuration (happens to be on Amazon but doesn&#8217;t have to) as follow:</p>

<p>Master1<br />
Master2<br />
Slave1<br />
Slave2<br />
Slave3</p>

<p>So my question is about the best practices using DLP:</p>

<p>1) Should I have DLP running on each slave (I presume there is the data) and if so<br />
2) Should each slave run only one DLP?&nbsp; </p>

<p>Generally, my question is about best architectural design of Hadoop/Infobright environment.<br />
Thanks</p>

<p>C_G
</p>
      ]]>
      </content>
    </entry>

    <entry>
      <title>Optimal Data types</title>
      <link rel="alternate" type="text/html" href="http://www.infobright.org/Forums/register/viewthread/3388/" />      
      <id>tag:infobright.org,2013:Forums/register/viewthread/.3388</id>
      <published>2013-05-09T09:54:37Z</published>
      <updated></updated>
      <author><name>casatedu</name></author>
      <content type="html">
      <![CDATA[
        <p>HI,</p>

<p>I was wondering if there is any performance difference between:<br />
TINYINT, SMALLINT, MEDIUMINT, INT, BIGINT</p>

<p>Is there any benefit using, for instance, TINYINT as oppose to  INT or BIGINT<br />
C_G
</p>
      ]]>
      </content>
    </entry>

    <entry>
      <title>Infobright and Tableau</title>
      <link rel="alternate" type="text/html" href="http://www.infobright.org/Forums/register/viewthread/3404/" />      
      <id>tag:infobright.org,2013:Forums/register/viewthread/.3404</id>
      <published>2013-05-23T12:24:42Z</published>
      <updated></updated>
      <author><name>casatedu</name></author>
      <content type="html">
      <![CDATA[
        <p>Hi guys,</p>

<p>I know that this questions should be posted on the Tableau forum (and I will do that shortly), but I was wondering if anybody had some similar experience.<br />
I query IEE trial version (4.07) with Tableau and Toad.<br />
The difference in result is quite significant.<br />
I have a single table with aprox 250 Millions records.<br />
A simple select count(*) from T took 2min when direct query against Infobright took  lest than millisecond.</p>

<p>The similar issue is with other queries.<br />
Does anybody experience this?<br />
G_C
</p>
      ]]>
      </content>
    </entry>

    <entry>
      <title>Infobright Load</title>
      <link rel="alternate" type="text/html" href="http://www.infobright.org/Forums/register/viewthread/3408/" />      
      <id>tag:infobright.org,2013:Forums/register/viewthread/.3408</id>
      <published>2013-05-24T11:30:16Z</published>
      <updated></updated>
      <author><name>casatedu</name></author>
      <content type="html">
      <![CDATA[
        <p>Hi there,</p>

<p>I’m loading data to IEE and wanted to verify one fact.<br />
My understanding is that if I run load with AUTOCOMMIT=0 than at the end of the load only session that was loading can see new data and any other session do not until I issue COMMIT.<br />
Furthermore, I can run all sort of verifying queries against new data as long as I run them in the same session/connection that loaded data. At that point I can commit or rollback, based upon my test queries.</p>

<p>Is this correct?<br />
Thanks</p>

<p>C_G
</p>
      ]]>
      </content>
    </entry>

    <entry>
      <title>Cluster on Load</title>
      <link rel="alternate" type="text/html" href="http://www.infobright.org/Forums/register/viewthread/3376/" />      
      <id>tag:infobright.org,2013:Forums/register/viewthread/.3376</id>
      <published>2013-05-08T10:07:36Z</published>
      <updated></updated>
      <author><name>casatedu</name></author>
      <content type="html">
      <![CDATA[
        <p>Hi Guys,</p>

<p>The recent patent on Cluster on Load is very excited news.<br />
I was wondering if it is already available in the current IEE eval copy.<br />
Thx<br />
C_G
</p>
      ]]>
      </content>
    </entry>

    <entry>
      <title>Supported platform</title>
      <link rel="alternate" type="text/html" href="http://www.infobright.org/Forums/register/viewthread/3356/" />      
      <id>tag:infobright.org,2013:Forums/register/viewthread/.3356</id>
      <published>2013-04-29T09:35:28Z</published>
      <updated></updated>
      <author><name>candeal01</name></author>
      <content type="html">
      <![CDATA[
        <p>Hi,</p>

<p>I Have choice of running Proof of Concept with IEE on Windows or Ubumtu 13.04.<br />
If I&#8217;m correct the Ubuntu is not supported.<br />
What do I risk trying to run anyway? Is it worth of trying or should I just stick with Windows?<br />
Thanks<br />
C_G
</p>
      ]]>
      </content>
    </entry>

    <entry>
      <title>ICE Max Memory for Linux x86_64</title>
      <link rel="alternate" type="text/html" href="http://www.infobright.org/Forums/register/viewthread/3304/" />      
      <id>tag:infobright.org,2013:Forums/register/viewthread/.3304</id>
      <published>2013-04-03T16:33:57Z</published>
      <updated></updated>
      <author><name>speedyk1</name></author>
      <content type="html">
      <![CDATA[
        <p>HI.&nbsp; Expert help needed.</p>

<p>I can not find any documentation on this.</p>

<p>What is the max memory that ICE could use on a Linux Redhat x86_64?</p>

<p>Thank you in advance.</p>

<p>Regards,<br />
speedyk1
</p>
      ]]>
      </content>
    </entry>

    <entry>
      <title>Using Infobright on Amazon EC2</title>
      <link rel="alternate" type="text/html" href="http://www.infobright.org/Forums/register/viewthread/2628/" />      
      <id>tag:infobright.org,2011:Forums/register/viewthread/.2628</id>
      <published>2011-11-08T23:55:12Z</published>
      <updated>2011-11-09T13:38:02Z</updated>
      <author><name>Jeff Kibler</name></author>
      <content type="html">
      <![CDATA[
        <p>In a recent blog post, we discussed loading data on Infobright via Amazon EC2 instances using DLP.&nbsp; If you have any questions regarding that thread or using Infobright on EC2, please post them here.</p>

<p>The original blog post is here: <a href="http://bit.ly/ib_blog_ec2_dlp">http://bit.ly/ib_blog_ec2_dlp</a>
</p>
      ]]>
      </content>
    </entry>


</feed>