<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
    xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
    xmlns:admin="http://webns.net/mvcb/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:content="http://purl.org/rss/1.0/modules/content/">

    <channel>
    
    <title>InfoBright Featured Blog Posts</title>
    <link>http://www.infobright.org/Open-source/Blog/Infobright_blog</link>
    <description></description>
    <dc:language>en</dc:language>
    <dc:creator>jeff.kibler@infobright.com</dc:creator>
    <dc:rights>Copyright 2013</dc:rights>
    <dc:date>2013-06-12T16:10:35+00:00</dc:date>
    <admin:generatorAgent rdf:resource="http://expressionengine.com/" />
    

    <item>
      <title>Best Practice: Using a Lookup Column in Infobright</title>
      <link>http://www.infobright.org/Blog/Entry/best_practice_using_a_lookup_column_in_infobright/</link>
      <guid>http://www.infobright.org/Blog/Entry/best_practice_using_a_lookup_column_in_infobright/#When:17:10:35Z</guid>
      <description><![CDATA[<p>Having wide, fat columns typically outperform star schemas.&nbsp; When condensing star schemas, one may experience performance drops with varchar/char fields for those &lsquo;dimension&rsquo; columns in this wide, fat table.&nbsp; Infobright&rsquo;s &lsquo;lookup&rsquo; functionality can assist you in optimizing performance.&nbsp; By hashing UNIQUE values of the lookup column in memory, we can achieve higher compression as well as faster query performance.&nbsp; We see these results as we are actually using an integer underneath the hood.&nbsp; In the past, we had stricter limitations surrounding lookups; starting in Infobright 4.x, those limitations have been loosened.<br /><br />While limitations may be loosened, one should also consider the ramifications of using 'lookup' columns. As the lookup is stored uncompressed in memory, you're limited by the amount of resources on the system. If you have a very large table with a very large number of distinct values, you may consume significant RAM resources with this lookup. Therefore I recommend you only utilize large lookups on columns which are critical and beneficial to you. Do not 'lookup' a table just because you can; consider how often you use that column and compare with resource consumption.<br /><br />In short, use lookups when you can maintain low cardinality (&gt;= 10:1 ratio of total-to-distinct values). When the total number of distinct values is extremely large, justify the use of RAM before setting the lookup flag.<br />To wrap up:<br />&nbsp; &bull; Have a &gt;= 10:1 ratio of total-to-distinct<br />&nbsp; &bull; Ensure you have enough RAM to hold all uncompressed, distinct values in memory (without causing other processes/queries to suffer)<br />&nbsp; &bull; Lookups are only applicable to varchar/char fields. Numbers and Dates will be ignored.<br />&nbsp; &bull; Only consider lookups for commonly used columns in the select, where, and group-by clauses of queries. Rarely used columns only suck up RAM usage.<br />&nbsp; &bull; Don't use lookups as a general 'surrogate key'; only use lookup when you need to use it.<br /><br />Other things to consider:<br />&nbsp; &bull; Initial Server Startup Time can be impacted if you have an extremely large number of lookup columns. It's pulling those values off disk and putting them in RAM when you start the service.<br />&nbsp; &bull; You're automatically taking RAM away from other processes/queries when using Lookups<br />&nbsp; &bull; You cannot change the DDL to remove or add new lookups. It requires a full data dump, drop table, create table, and re-load in order to add/remove lookup columns. In the future, we hope to change where 'lookups' are defined, but at least for now, it's a risk.<br />&nbsp; &bull; DomainExpert&trade; technology (beginning in 4.0) is a great alternative for any column which doesn't fit the lookup paradigm *and* has a repeatable pattern (ex: e-mail addresses).<br />&nbsp; &bull; If DomainExpert and Lookups do not qualify, adding an md5-hash-equivalent column can help with query times on char/varchar columns. More information on MD5 hashing can be found here:&nbsp;<a href="http://www.infobright.org/images/uploads/blogs/how-to/How_To_Efficiently_Search_Strings_in_Infobright.pdf">http://www.infobright.org/images/uploads/blogs/how-to/How_To_Efficiently_Search_Strings_in_Infobright.pdf</a></p>]]></description>
      <dc:subject></dc:subject>
      <dc:date>2013-06-12T17:10:35+00:00</dc:date>
    </item>

    <item>
      <title>Common steps for debugging connection issue</title>
      <link>http://www.infobright.org/Blog/Entry/common_steps_for_debugging_connection_issue/</link>
      <guid>http://www.infobright.org/Blog/Entry/common_steps_for_debugging_connection_issue/#When:20:19:55Z</guid>
      <description><![CDATA[<p>It has been a little while since I have had the time to sit and write again and I can admit that I quite miss it.  From a community standpoint, myself and the intern team have been really busy with getting the new websites in place.  We also have alot plans for some really awesome projects being planned with some other open source companies like Akiban, as well as some Social Media contests.  Recently though, I have been handling a fair amount of requests dealing with common issues, ranging from simple connections to permission issues.    As a general rule, approaching these issues, there is a common few steps that are taken to help eliminate where the issue lies.</p>
<p>On Unix/Linux:</p>
<ol>
<li>From prompt type  'sudo su'</li>
<li>Enter your password if prompted</li>
<li>type    /etc/init.d/mysqld-ib start</li>
<li>If a process is already running, it will let you know </li>
<li>type     mysql-ib -u{username}</li>
<li>Once the mysql prompt is up, start at Step 4 below.</li>
</ol>
<p>On windows:</p>
<ol>
<li>Open a command prompt by right clicking and use 'Run as Administrator'</li>
<li>type  'sc start infobright' (no apostrophes) to determine if infobright is running.  If Infobright is running, you will get an error indicating that.</li>
<li>In explorer, navigate to the Infobright installation and double click  Infobright-client.bat , which may also be in your Start &gt;&gt; Programs &gt;&gt; Infobright</li>
<li>At the mysql prompt type "SHOW GRANTS FOR CURRENT_USER;" <br /><br /> mysql&gt; show grants for current_user; <br /> +---------------------------------------------------------------------+ <br /> | Grants for root@localhost                                           | <br /> +---------------------------------------------------------------------+ <br /> | GRANT ALL PRIVILEGES ON *.* TO 'root'@'localhost' WITH GRANT OPTION | <br /> +---------------------------------------------------------------------+ <br /> 1 row in set (0.00 sec) <br /> <br /> mysql&gt; </li>
<li>IF your user can ONLY access the Infobright instance from certain locations (i.e. localhost, 127.0.0.1 ), it will show that here. <br />&nbsp;<br /> mysql&gt; show grants for current_user; <br /> +---------------------------------------------------------------------+ <br /> | Grants for root@localhost                                           | <br /> +---------------------------------------------------------------------+ <br /> | GRANT ALL PRIVILEGES ON *.* TO 'root'@'%' WITH GRANT OPTION | <br /> +---------------------------------------------------------------------+ <br /> 1 row in set (0.00 sec) <br /> <br /> mysql&gt; </li>
<li>IF it shows a percentage sign, then your permissions are for any location to the instance. Notice the difference in localhost and %</li>
<li>to add remote access type :<br /> mysql&gt; GRANT ALL ON *.* TO 'root'@'%';</li>
<li>Now if you are unable to get in, there must be a firewall on this machine blocking the requests in.</li>
</ol>
<p>This is a very common issue because a default installation of Infobright only allows the root user access to the instance from the machine itself. Granting the remote access is very common usage, though not suggested for the root user becuase it is not good practice to allow root access except from the machine itself, as a security protocol.  Stay tuned next week for some more exciting news concerning the Infobright Community....</p>]]></description>
      <dc:subject></dc:subject>
      <dc:date>2012-11-08T20:19:55+00:00</dc:date>
    </item>

    <item>
      <title>Big Data Infographc</title>
      <link>http://www.infobright.org/Blog/Entry/big_data_infographc/</link>
      <guid>http://www.infobright.org/Blog/Entry/big_data_infographc/#When:15:09:05Z</guid>
      <description><![CDATA[<p>An infographic is a good way to explain something in a very visual way. As many of you know, we do a number of things to help people make sense of the crazy database world. Here is an infographic on Big Data that helps explain row vs column vs NoSQL. Be sure to click the right arrows to see more info.<a href="http://www.infobright.com/tmp270812/"> </a><a href="http://www.infobright.com/tmp270812/">Take a look! </a></p>]]></description>
      <dc:subject></dc:subject>
      <dc:date>2012-09-20T15:09:05+00:00</dc:date>
    </item>

    <item>
      <title>MongoDB and Infobright</title>
      <link>http://www.infobright.org/Blog/Entry/mongodb_and_infobright/</link>
      <guid>http://www.infobright.org/Blog/Entry/mongodb_and_infobright/#When:15:05:47Z</guid>
      <description><![CDATA[<p>if you have been following Infobright for any period of time, you know that we are big proponents of using the right database for the right use case. That's why we spend time and effort helping educate our prospects and community members about different technologies and which use cases they are best suited for. Recently Jeff Kibler, our SE extraordinaire and former community manager, downloaded and tested MongoDB and has written a new white paper about using MongoDB with Infobright. You can download it from our white paper page: <a href="http://support.infobright.com/Support/Resource-Library/Whitepapers/">http://support.infobright.com/Support/Resource-Library/Whitepapers/</a> (scroll down a bit to the Infobright Technical White Papers and you'll see it).</p>
<p>If you haven't already read the white paper "The Emerging Database Landscape" which discusses row, columnar, and NoSQL databases you can download it here:<strong> </strong><a href="http://www.infobright.com/land/emerging_database_landscape/">http://www.infobright.com/land/emerging_database_landscape/</a></p>]]></description>
      <dc:subject></dc:subject>
      <dc:date>2012-09-05T15:05:47+00:00</dc:date>
    </item>

    <item>
      <title>OSCON 2012 &#45; Portland</title>
      <link>http://www.infobright.org/Blog/Entry/oscon_2012_-_portland/</link>
      <guid>http://www.infobright.org/Blog/Entry/oscon_2012_-_portland/#When:15:43:02Z</guid>
      <description><![CDATA[<p>I just recently returned from the <a href="http://www.oscon.com/oscon2012">O'Reilly Open Source Conference</a> in Portland, where Infobright was exhibiting.  For those of you who have not had a chance to visit Portland, it is a great location for this conference.  As you walk into the Convention Center, you do not realize how big the property is until you actually find the hall where the conference is.  There are these huge sprawling hallways that tend to open up and just when you thought you found the end, there is more.  Walking into the hall where the exhibitors were located, there seemed to be a real sense of excitement from everyone involved, not the sense of "this is something we have to do", but more like something everyone was looking forward to. This seemed to be true for the attendees as well.</p>
<p>The exhibitor list ranged from all of the big companies that I expected to see, such as Oracle, Intel, O'Reilly, to quite a few of the up and comers in open source, like Akiban.  One of the bigger themes seemed to be the Cloud, and it was clearly attracting a lot of interest.  What I found to be most interesting was that the general intent of the attendees seemed to be education.  A majority of the attendees seemed to exhibit a real interest in learning about new technologies, which for me meant the need to figure out "How do I earn their attention?".  If you are reading this, chances are you already know who Infobright is and what we do, which we excel at.  But to continue to grow a strong community, we have to continue to push and expand our base.  This was the approach I took and it worked well.</p>
<p>The majority of the attendees I spoke with were unaware of Infobright and how we play a major role in the big data industry.  I was able to attract many of them by discussing the core of our technology and how that can be used to solve some of the issues they face within their own businesses.</p>
<p>The conference itself was a hodge-podge of various technologies, which helped to keep attendees on their toes between many of the different speaker sessions that were available.  They were eager to learn about new technologies, but it was a constant reminder to me that even though the big data industry is large, the general knowledge and understanding of it is mostly known to those people within the industry.  There are numerous engineers and architects who were already familiar with Infobright but the vast majority were not.  One of the individuals I spoke with who was already familiar with us said, "Infobright, yeah, tested it, liked it, performed great."</p>
<p>The weekly <a href="http://www.infobright.com/Event/infobright_technical_webinar_series_june_11/">technical webinars</a> Infobright puts on align with the areas of interest I saw at OSCON&ndash; helping educate people on the vast array of emerging technologies for dealing with big data. Education was why so many people came to OSCON. Don't make that a once a year proposition, continue to seek out new sources of learning.</p>
<p>So, for Infobright, the conference was a success and left a good impression on me for the next one.  Oh, but you should definitely check out Intel's new Open Source technology Center at <a href="https://www.01.org/">01.org</a>.</p>]]></description>
      <dc:subject></dc:subject>
      <dc:date>2012-07-27T15:43:02+00:00</dc:date>
    </item>

    <item>
      <title>Big Data Who, Big Data What, Big Buzz, Big ?</title>
      <link>http://www.infobright.org/Blog/Entry/big_data_who_big_data_what_big_buzz_big/</link>
      <guid>http://www.infobright.org/Blog/Entry/big_data_who_big_data_what_big_buzz_big/#When:22:04:59Z</guid>
      <description><![CDATA[<p>I was recently asked this question, "How do you define big data?"&nbsp; My response was "I say big data defines you."</p>
<p>With that in mind, let me expand on my definition.&nbsp; To do this, I would like to digress and say that big data in definition is exactly that; it is a big amount of data. (Current industry discussions add Velocity and Variety, and sometimes Value, meaning how fast it needs to be analyzed to provide value). Data is by definition information in some format or structure, so "big" must be analogous for amount.&nbsp; Yet my definition of big data goes a bit past this.&nbsp; The big data industries that have formed over the last decade focus on many different aspects of it; some deal with the infrastructure to house and manage, some deal with the software, some are in the analytic and business intelligence world, some are the consultants, and numerous companies focus on marketing, education and conferences on the subject. Within each segment of the industry, definitions may vary a little, but there are still some very key attributes that remain the same.</p>
<p>One, there are vast amounts of data which present unique challenges.&nbsp; Two, above and beyond the tasks of storing, managing and analyzing, working with this data is a primary goal (otherwise, why store it.)&nbsp; Three, we look for insights that we can extract from this information to enable us to make decisions that will define the way we act upon the information. For instance, a police chief in Pennsylvania knew that by analyzing criminal behavior and patterns within them, he would be able to reduce crime in a geographic area by implementing a higher police presence at specific times.&nbsp; The data and information that he extracted defined his response. Whether someone is researching a large number of publications and extracting relevant topics on specific articles, or they are gathering sales information for yearly trends during the rise and fall of economic windfalls, the process is still the same: we store, we search, we analyze, we define.</p>
<p>As a programmer myself, I have dealt with the various aspects of the infrastructure surrounding data.&nbsp; It is the service- and data-oriented interfaces that provide the vehicle to deliver information in a meaningful way.&nbsp; I have always focused on ensuring that the end user was able to get to the data that they were looking for, otherwise the data is almost useless.&nbsp; Data's reason for existing is solely the value of that information and what can be extracted from it.&nbsp; One of the most exciting aspects of this industry for me was reading about the different use cases for our analytic database and the very unique approaches that different organizations took to solving complex problems in simple elegant steps.&nbsp; Browse to <a href="http://www.infobright.com">http://www.infobright.com</a> and read about how the <a href="http://www.infobright.com/Customer/canadian_space_agency">Canadian Space Agency</a> is using Infobright to store and read their machine-generated data, or any one of the <a href="http://www.infobright.com/Customer/jdsu">JDSU</a> or telecom-related <a href="http://support.infobright.com/Support/Resource-Library/Whitepapers/">whitepapers</a>.<br />&nbsp;<br />At the end of the day, we define our actions by what we learn.</p>]]></description>
      <dc:subject></dc:subject>
      <dc:date>2012-07-17T22:04:59+00:00</dc:date>
    </item>

    <item>
      <title>Become a Hero (or Heroine) to Marketing..Web Analytics Made Easy</title>
      <link>http://www.infobright.org/Blog/Entry/become_a_hero_or_heroine_to_marketing..web_analytics_made_easy/</link>
      <guid>http://www.infobright.org/Blog/Entry/become_a_hero_or_heroine_to_marketing..web_analytics_made_easy/#When:20:23:38Z</guid>
      <description><![CDATA[<p>Last week Infobright, Pentaho and Semphonic <a href="http://www.infobright.com/land/infobright_pentaho_semphonic_big_data_web_analytics">announced a joint initiative</a> to help organizations derive business advantage from their detailed website data. For companies who depend on their websites to drive revenue (whether that be lead generation, ecommerce, or other activities) our goal is twofold:</p>
<ul>
<li>Help marketing teams understand what data is important and what measurements can deliver actionable information</li>
<li>Help IT to quickly and easily extract relevant data from their Web analytics tools and implement an interactive dashboard and reporting system that marketing analysts can get value from</li>
</ul>
<p>As Web analytics has been around almost as long as the Web itself, you may be wondering why you need something more than you probably have today. To paraphrase from the white paper, <a href="http://www.infobright.com/land/digitalmeasurement_bestpractices.html">&ldquo;Everything You Know about Digital Measurement is Wrong..and How to Get it Right&rdquo; </a>, the problem is that all of the traditional Web analytics metrics are meaningless in terms of marketing and targeting. &nbsp;Segmentation in the digital age is more complex than with brick-and-mortar stores, and it requires both a new way of modeling the data and a new way to present it. It needs to answer not just Who? but Why?</p>
<p>The work that Infobright, Pentaho and Semphonic have done is a great start towards making it easy to implement a system that provides the answers your marketing teams need. To learn more, <a href="http://www.infobright.com/land/digitalmeasurement_bestpractices.html">download</a> the white paper, <a href="https://infobrightevents.webex.com/mw0307l/mywebex/default.do?nomenu=true&amp;siteurl=infobrightevents&amp;service=6&amp;rnd=0.942246962487935&amp;main_url=https%3A%2F%2Finfobrightevents.webex.com%2Fec0606l%2Feventcenter%2Fevent%2FeventAction.do%3FtheAction%3Ddetail%26confViewID%3D1002770644%26%26%26%26siteurl%3Dinfobrightevents">register</a> for this the July 19 webinar, or <a href="http://webanalytics.infobright.com/">try out the demo </a>for yourself. Become an IT star to your marketing team!</p>
<p>We&rsquo;d love any feedback and comments you have. Just sent us a note at <a href="mailto:info@infobright.com">info@infobright.com</a></p>]]></description>
      <dc:subject></dc:subject>
      <dc:date>2012-07-16T20:23:38+00:00</dc:date>
    </item>

    
    </channel>
</rss>