Is there some kind of tutorial I could follow to develop my test warehouse?
I have such scenario:
- I have a large amount of logs; I need to grep these logs in order to retrieve a piece of knowledge on a regular basis.
- the idea is to load all the data into IB database and build a simple web interface to query the database; this way I could take the burden of knowledge retrieval off me (other people could use the interface after a short tutorial)
- an additional surplus would be the size; IB can compress better than gzip.
The questions:
- how should I organize my data? Every single record contains a timestamp, 3 strings (always present), and a line of comment of variable length; I need to store 400 million records. A record is approx. 240B wide
- should I use a single table to store all my rows or should I split the data across several tables?
- how does IB behave when there are several (5-10) queries at once to the same database/table?
- what will be the response time of a query of an idle DB, or a busy DB (running 5 other queries)
- what hardware do I need to run my system smoothly?
- how do I import the data?
- how do I delete the oldest data?
To say the truth I’ve been experimenting with IB for some time, but the results are not satisfactory.

