Log in

Generating large example data with TPC-H

Several times I had the need for some large data sets to do some Data Vault tests at customer site, writing a blogpost, doing a demo or a webinar and many more. And sometimes I need data to do performance or data usage tests on different databases. Due to my work together with EXASOL I focused on the TPC-H tool DBGen to generate gigabytes of data.

To share my experience with DBGen generating large data sets I wrote this blogpost as a step by step instruction.

  • Geschrieben von Dirk Lerner
  • Zugriffe: 11784

Look Back Over TDWI 2016

Last week in June I was at the TDWI Conference 2016 at Munich. ITGAIN, my employer, had as a platin sponsor a booth to present our products and services!

In my point of view, it was another great TDWI conference at Munich with a lot of awesome people I could talk with - including an interesting discussion about data architecture with Mark (Madsen) and all the nonsense happening in the Big Data world.

  • Geschrieben von Dirk Lerner
  • Zugriffe: 2086

Meetup – Data Vault Interest Group

I reactivated my Meetup Data Vault Interest Group this week. Long time ago I was thinking about a table of fellow regulars to network with other, let’s call them Data Vaulters. It should be a relaxed get-together, no business driven presentation or even worse advertisement for XYZ tool, consulting or any flavor of Data Vault. The feedback of many people was that they want something different to the existing Business Intelligence Meetings. So, here it is!

  • Geschrieben von Dirk Lerner
  • Zugriffe: 3715

High performance - Data Vault and Exasol

You may have received an e-mail invitation from EXASOL or from ITGAIN inviting you to our forthcoming webinar, such as this:

Do you have difficulty incorporating different data sources into your current database? Would you like an agile development environment? Or perhaps you are using Data Vault for data modeling and are facing performance issues?
If so, then attend our free webinar entitled “Data Vault Modeling with EXASOL: High performance and agile data warehousing.” The 60-minute webinar takes place on July 15 from 10:00 to 11:00 am CEST.
  • Geschrieben von Dirk Lerner
  • Zugriffe: 5460

How to load easy some data vault test data

Some time ago a customers asked me how to load easy and simple some (test)data into their database XYZ (chose the one of your choice and replace XYZ) to test their new developed Data Vault logistic processes.
The point was: They don’t want to use all this ETL-tool and IT-processes overhead just to some small test in their own environment. If this this is well done from a data governance perspective? Well, that’s not part of this blogpost. Just do this kind of thingis only in your development environment.

  • Geschrieben von Dirk Lerner
  • Zugriffe: 4039
  • Data Vault
  • Agile EDW
  • Conferences
  • Trainings
  • Bücher
  • Other
  • TDWI Roundtable

    Blogposts around the TDWI Roundtable Frankfurt.

  • Data Architecture
  • FOM

    Fact-Oriented Modeling (FOM) stands for a family of fact-oriented conceptual modeling methods. FOM facilitates easier communication about the conceptual model between the modeler and the domain expert by verbalization of concrete examples in the language of the domain expert, a design process as a guide for creating the model and the focus on elementary facts. The most popular methods in this family are Cognition Enhanced Natural Language Information Analysis Method (CogNIAM), Second Generation Object Role Modeling (ORM 2) and Fully Communication Oriented Information Modeling (FCO-IM).

  • General Modeling
  • Bitemporal Data

    If everything would happen at the same time, there would be no need to store historic data. We, the consumers of data, would know each and everything at the same instant. Beside all the other philosophical impacts, if time wouldn’t exists, is data still necessary?

    (Un)fortunately time exists and data architects, data modelers and developers have to deal with it in the world of information technology.

    In this category about temporal data I will collect all my blogposts about this fancy topic.

  • Data Modeling Tools
  • Data Modeling Certification