<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>BCsolution.com</title>
	<atom:link href="http://www.bcsolution.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.bcsolution.com</link>
	<description>Dashboards to Data Warehouses</description>
	<lastBuildDate>Thu, 17 Dec 2009 22:17:28 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>SQL Server DWH Opportunity</title>
		<link>http://www.bcsolution.com/2009/12/sql-server-dwh-opportunity/</link>
		<comments>http://www.bcsolution.com/2009/12/sql-server-dwh-opportunity/#comments</comments>
		<pubDate>Thu, 17 Dec 2009 22:15:54 +0000</pubDate>
		<dc:creator>cwardell</dc:creator>
				<category><![CDATA[Latest News]]></category>

		<guid isPermaLink="false">http://www.bcsolution.com/?p=571</guid>
		<description><![CDATA[SQL Server DWH Opportunity - The Analyst is responsible for analyzing business procedures and problems to define data, data structures and algorithms for data saved to and extracted from the systems database.]]></description>
			<content:encoded><![CDATA[<p><strong>Summary: </strong><br />
The Analyst is responsible for analyzing business procedures and problems to define data, data structures and algorithms for data saved to and extracted from the systems database.</p>
<p><strong>Essential Duties:</strong></p>
<ul>
<li>Analyzes, designs and implements accurate, secure, efficient, logical and physical databases and coordinates database development applying knowledge of database design standards and database management system.</li>
<li>Identifies and resolves production and/or applications development problems related to the use of the</li>
<li>Database management system and end-user application software or utilities.</li>
<li>Reviews project request describing database user needs.</li>
<li>Studies existing data handling systems to evaluate effectiveness and develops new systems to improve production or workflow as required.</li>
<li>Specifies in detail logical and/or mathematical operations to be performed by various equipment units and/or comprehensive computer programs, and operations to be performed by personnel in system.</li>
<li>Estimates time and cost required to accomplish project.</li>
<li>Determines if project requires creating series of new programs or modifying existing programs that access data stored in databases.</li>
<li>Attends specifications meeting with project team workers to determine scope and limitations of project.</li>
<li>Reviews procedures in database management system manuals for making changes to database, such as defining, creating, revising, and controlling database.</li>
<li>Revises company definition of data as defined in data dictionary (information about data, including name, description, source of data item, and key words for categorizing and searching for data item descriptions).</li>
<li>Determines and enters changes to data dictionary descriptions, including type, structure, and intended use of data within system, or assigns data entry work to programmer.</li>
</ul>
<p><strong>Job Requirements:</strong></p>
<ul>
<li>Strong inter-personal and communication skills; capable of mentoring staff, acting as a vendor liaison, making presentations/leading meetings, and working closely with management.</li>
<li>Ability to solve problems quickly and completely.</li>
<li>Ability to identify tasks which require automation and automate them.</li>
<li>A solid understanding of computer technologies and the direction of new technologies.</li>
<li>Ability to plan and organize effectively.</li>
<li>A solid understanding of the reinsurance and how technology can be applied to improve the business process.</li>
<li>Maintains confidentiality of company-sensitive data.</li>
<li>Keeps the Director informed of activities and situations, which will impact the achievement of corporate, and department goals and objectives.</li>
<li>Performs other related duties as required to achieve the goals and objectives of the company and department.</li>
</ul>
<p><strong>Technical/Experience Requirements: </strong></p>
<ul>
<li>Must have strong development life-cycle experience</li>
<li>Must have SQL development experience with large databases</li>
<li>Must have experience with a high level programming language in the development and delivery of complex software projects.</li>
<li>Must have the ability to communicate clearly, concisely to non technical audiences</li>
</ul>
<p><strong>Supervisory Responsibilities:</strong> This position may have Supervisory Responsibilities</p>
<p><strong>Contact:</strong> <a title="Jen Perri" href="mailto:jen@mriunion.com">Jen Perri</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.bcsolution.com/2009/12/sql-server-dwh-opportunity/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Word on the Street (Teradata, Netezza, Vertica, ParAccel, Greenplum)</title>
		<link>http://www.bcsolution.com/2009/10/database-appliance-buzz/</link>
		<comments>http://www.bcsolution.com/2009/10/database-appliance-buzz/#comments</comments>
		<pubDate>Fri, 02 Oct 2009 05:14:41 +0000</pubDate>
		<dc:creator>cwardell</dc:creator>
				<category><![CDATA[Latest News]]></category>
		<category><![CDATA[MPP]]></category>
		<category><![CDATA[data warehouse]]></category>
		<category><![CDATA[Greenplum]]></category>
		<category><![CDATA[Netezza]]></category>
		<category><![CDATA[Paraccel]]></category>
		<category><![CDATA[Teradata]]></category>
		<category><![CDATA[Vertica]]></category>

		<guid isPermaLink="false">http://www.bcsolution.com/?p=502</guid>
		<description><![CDATA[I started investigating Google Insight to see what the search volumes are with regard to a few of the competing database technologies. My interest is primarily in Teradata, Netezza, Vertica, ParAccel, and Greenplum. Since text mining ]]></description>
			<content:encoded><![CDATA[<p>As I venture into my new skunk works &#8220;developing a sentiment based analysis platform&#8221;, I started investigating Google Insight to see what the search volumes are with regard to a few of the competing database technologies. My interest is <span id="more-502"></span>primarily in  Teradata, Netezza, Vertica, ParAccel, and Greenplum. Since text mining and sentiment analysis are part of my product road-map, I thought investigating search volumes would be an interesting metric when placed side by side with sentiment. Although I have not performed a sentiment analysis on the blogosphere or news feeds for this topic, I believe that search metrics can provide some pretty interesting insight.</p>
<p>All three charts span the last 12 months. Teradata is left off of the second chart so I can have finer granularity when comparing Netezza, Vertica, ParAccel, and Greenplum. Likewise, Netezza is left off of the third chart so I can have more granularity on Vertica, ParAccel and Greenplum.</p>
<p>It is important to note that according to Google: &#8220;The numbers on the graph reflect how many searches have been done for a particular term, relative to the total number of searches done on Google over time. They don&#8217;t represent absolute search volume numbers, because the data is normalized and presented on a scale from 0-100. Each point on the graph is divided by the highest point, or 100. When we don&#8217;t have enough data, 0 is shown. The numbers next to the search terms above the graph are summaries, or totals.&#8221;</p>
</p>
<h2>Including Teradata</h2>
<p><code><script src="http://www.gmodules.com/ig/ifr?url=http%3A%2F%2Fwww.google.com%2Fig%2Fmodules%2Fgoogle_insightsforsearch_interestovertime_searchterms.xml&amp;up__property=empty&amp;up__search_terms=Vertica%7CGreenplum%7CParAccel%7CNetezza%7CTeradata&amp;up__location=US&amp;up__category=0&amp;up__time_range=12-m&amp;up__compare_to_category=false&amp;synd=ig&amp;w=450&amp;h=350&amp;lang=en-US&amp;title=Google+Insights+for+Search&amp;border=%23ffffff%7C3px%2C1px+solid+%23999999&amp;output=js" type="text/javascript"></script> </code></p>
<p>Needless to say, I was quite surprised to see the amount of searches regarding Teradata.  It is interesting to note that the dip in October 08 followed by the sharp decline in January 09 follows an October 13th wire on &#8220;<a title="Teradata Breaks the Affordability Barrier to Analyzing Petabyte-Sized Data Volumes" href="http://www.reuters.com/article/pressRelease/idUS200368+13-Oct-2008+BW20081013" target="_blank">Teradata Breaks the Affordability Barrier to Analyzing Petabyte-Sized Data Warehouses</a>&#8221; What I did notice is that the top searches for Teradata were technical in nature. IE: &#8220;Teradata SQL&#8221;, &#8220;Teradata date&#8221;, &#8220;Teradata Syntax&#8221; with searches rising on the topic of &#8220;Teradata Spool&#8221;, &#8220;Teradata Rank&#8221;, &#8220;Teradata SAP&#8221;.</p>
<p>Given the fact that the majority of the searches are technical in nature, it becomes obvious as to why the search volume was so high over the other products. Looking into search terms that are more exploratory or comparative are probably far more valuable from a marketing point of view. If I find some time,  I may do another post that would analyze more &#8220;inquisitive&#8221; terms likely submitted by potential customers.</p>
</p>
<h2>Omitting Teradata</h2>
<p><code> <script src="http://www.gmodules.com/ig/ifr?url=http%3A%2F%2Fwww.google.com%2Fig%2Fmodules%2Fgoogle_insightsforsearch_interestovertime_searchterms.xml&amp;up__property=empty&amp;up__search_terms=Vertica%7CGreenplum%7CParAccel%7CNetezza&amp;up__location=US&amp;up__category=0&amp;up__time_range=12-m&amp;up__compare_to_category=false&amp;synd=ig&amp;w=450&amp;h=350&amp;lang=en-US&amp;title=Google+Insights+for+Search&amp;border=%23ffffff%7C3px%2C1px+solid+%23999999&amp;output=js" type="text/javascript"></script></code></p>
<p>When omitting Teradata you get a better sense as to where Netezza is relative to the other contenders. It is clear that Netezza is holding on to their position and slightly widening the gap as of late. I do however believe that the upward trend is related to the launch of Twinfin. If I am right, I would have expected to see more volume of searches. What is interesting is that unlike Teradata, The top search terms for Netezza are &#8220;Netezza Database&#8221;, &#8220;Netezza Teradata&#8221;, &#8220;Netezza Corporation&#8221;, &#8220;Netezza data warehouse&#8221; This is excellent news for Netezza as there seems to be an obvious interest and comparison between Netezza and Teradata. I believe my hunch is right regarding one of my earlier posts &#8220;<a title="Netezza Paved the Way" href="http://www.bcsolution.com/2009/05/netezza-paved-the-way/" target="_blank">Netezza Paved the Way</a>&#8220;.</p>
<p>Netezza is well positioned against Teradata. Netezza has a large opportunity here because unless Teradata is being purchase with a whole host of other applications and logical data models, Netezza should be winning the analytics play. Switching from Teradata to Netezza is a hard sell even when the Teradata platform is fully amortized. A large portion of Teradata&#8217;s revenue is related to professional services but the nature of the search terms lead me to believe that Teradata is expanding existing environments and may not be seeing new client acquisition (for analytics that is). This is purely a guess but can easily be discovered from their filings. Perhaps Netezza should work on a migration strategy to chip away at the existing Teradata stronghold but definately attempt to take them head on for new client aquisition that fall into the extreme analytics category.</p>
</p>
<h2>Omitting Teradata and Netezza</h2>
<p><code><script src="http://www.gmodules.com/ig/ifr?url=http%3A%2F%2Fwww.google.com%2Fig%2Fmodules%2Fgoogle_insightsforsearch_interestovertime_searchterms.xml&amp;up__property=empty&amp;up__search_terms=Vertica%7CGreenplum%7CParAccel&amp;up__location=US&amp;up__category=0&amp;up__time_range=12-m&amp;up__compare_to_category=false&amp;synd=ig&amp;w=450&amp;h=350&amp;lang=en-US&amp;title=Google+Insights+for+Search&amp;border=%23ffffff%7C3px%2C1px+solid+%23999999&amp;output=js" type="text/javascript"></script></code></p>
<p>The pattern of Greenplum and Vertica trend-lines are so close in nature that it appears they are going head to head on most deals. Vertica does edge out Greenplum by a decent margin from an inquisitive search metric. The April 09 spike related to Greenplum seems to be related to the article on April 7th &#8220;<a title="GoldenGate and Greenplum" href="http://www.goldengate.com/news/pressreleases/pr_greenplum_040709.html" target="_blank">GoldenGate Software and Greenplum Join Forces to Enable High Performance, Real-Time Data Feeds to the Warehouse</a>&#8220;</p>
<p>Paraccel has a few good announcements lately  but it&#8217;s tough to be in 4th position. Paraccel&#8217;s recent climb seems to be related to the Price Choper deal. I also expect to see climb in search volume in the month of October related to the announcement &#8220;<a title="Price Chopper Selects" href="http://www.reuters.com/article/pressRelease/idUS103889+18-May-2009+BW20090518" target="_blank">Scales-Out &#8211; Merkle Scales-Out its ParAccel Consumer Data Warehouse Platform to 20 Terabytes</a>&#8220;. From a search perspective, it still trails behind Vertica and Greenplum quite a bit.</p>
<p><code><br />
<script type="text/javascript" src="http://www.gmodules.com/ig/ifr?url=http%3A%2F%2Fwww.google.com%2Fig%2Fmodules%2Fgoogle_insightsforsearch_interestovertime_searchterms.xml&amp;up__property=empty&amp;up__search_terms=vertica%7Cgreenplum%7C&amp;up__location=empty&amp;up__category=0&amp;up__time_range=12-m&amp;up__compare_to_category=false&amp;synd=ig&amp;w=450&amp;h=350&amp;lang=en-US&amp;title=Google+Insights+for+Search&amp;border=%23ffffff%7C3px%2C1px+solid+%23999999&amp;output=js"></script><br />
</code></p>
<p>Vertica has a steady upward climb going on related to search and all searches seem to be inquisitive in nature. IE: &#8220;Vertica Systems&#8221;, &#8220;Vertica Database&#8221;, &#8220;Sybase Vertica&#8221; Their recent release &#8220;<a title="Pink OTC" href="http://www.reuters.com/article/pressRelease/idUS125746+10-Sep-2009+PRN20090910" target="_blank">Pink OTC Markets Inc. Cuts Reporting Time, Hardware and Programming Costs with Data&#8230;</a>&#8221; is bringing the climb upward still. It is possible that another few deals would put Vertica in a position to break away into the Teradata, Netezza, Vertica trifecta.</p>
</p>
<h2>Summary</h2>
<p>My analysis needs to stop here for now. It would be nice to have the time to do a deep dive study on the technologies from a search, attitudinal, perception and sentiment point of view. This quick analysis was pretty interesting but it is just the tip of the iceberg and focuses on a small segment of data. This year there will be about 24 terabytes of social network data and news feeds. Carving out the MPP Database section of the web would prove to be an interesting study.</p>
<p>So in general this quick study leads me to believe:</p>
<ul>
<li>Teradata&#8217;s Search volume is mostly technical in nature possibly indicating that new customers are hard to come by</li>
<li>Netezza is being compared against Teradata and the searches for Netezza are inquisitive or comparative</li>
<li>Vertica is being compared against Greenplum and Sybase and all searches seem to be inquisitive or comparative</li>
</ul>
</p>
<address>Regards,</address>
<address>Charlie</address></p>
]]></content:encoded>
			<wfw:commentRss>http://www.bcsolution.com/2009/10/database-appliance-buzz/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Investment Bank looking for Netezza Resource</title>
		<link>http://www.bcsolution.com/2009/09/investment-bank-netezza-resource/</link>
		<comments>http://www.bcsolution.com/2009/09/investment-bank-netezza-resource/#comments</comments>
		<pubDate>Wed, 02 Sep 2009 04:34:35 +0000</pubDate>
		<dc:creator>cwardell</dc:creator>
				<category><![CDATA[Latest News]]></category>
		<category><![CDATA[Job Opportunity]]></category>
		<category><![CDATA[Netezza]]></category>

		<guid isPermaLink="false">http://www.bcsolution.com/?p=496</guid>
		<description><![CDATA[Folks,
I just came in contact with a pretty good technical recruiter. He seems genuinely passionate about his career and asked me if I knew any hands-on Netezza resources. I thought I would put this out there in case you have an interest.
Our client, a New York City Investment Bank is looking for a hands on [...]]]></description>
			<content:encoded><![CDATA[<p>Folks,</p>
<p>I just came in contact with a pretty good technical recruiter. He seems genuinely passionate about his career and asked me if I knew any hands-on Netezza resources. I thought <span id="more-496"></span>I would put this out there in case you have an interest.</p>
<blockquote><p>Our client, a New York City Investment Bank is looking for a hands on technical expert to work on a critical high availability platform. Must have experience with the following: Netezza data warehouse (1 year), Informatica (2 Years), SQL, PL/SQL and Oracle. Platforms include Solaris, Linux and Windows. Must have worked on the full SDLC. Major pluses include Cognos 8.2 and Autosys. The location is mid town NYC and the salary is between $100,000-$120,000 based on experience. If you are interested, please shoot your resume to jbnewman@genesis10.com.</p>
<p>Jeffrey B. Newman<br />
Senior Technical Recruiter<br />
Genesis10 &#8211; Business and Technology Consultants<br />
212.688.5522 ext. 204<br />
www.genesis10.com<br />
http://www.linkedin.com/in/newmanlinkedin</p></blockquote>
]]></content:encoded>
			<wfw:commentRss>http://www.bcsolution.com/2009/09/investment-bank-netezza-resource/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The V-Stick is Coming</title>
		<link>http://www.bcsolution.com/2009/07/vstick-coming/</link>
		<comments>http://www.bcsolution.com/2009/07/vstick-coming/#comments</comments>
		<pubDate>Mon, 13 Jul 2009 02:07:25 +0000</pubDate>
		<dc:creator>cwardell</dc:creator>
				<category><![CDATA[Latest News]]></category>
		<category><![CDATA[V-Stick]]></category>
		<category><![CDATA[LogiXML]]></category>
		<category><![CDATA[Vertica]]></category>

		<guid isPermaLink="false">http://www.bcsolution.com/?p=446</guid>
		<description><![CDATA[What do you get when you combine

A bootable USB Stick
A full blown Linux OS with GUI
Evaluation Version of the Vertica Analytics database Fully installed and configured
Apache/Tomcat Web Server
LogiXML Dashboard Reporting
Sample Data and Dashboards

Introducing the V-Stick
Powered by Vertica and LogiXML


We are currently putting the final touches on a sample dash-board and then will begin testing the [...]]]></description>
			<content:encoded><![CDATA[<p style="text-align: justify;">What do you get when you combine</p>
<ul style="text-align: justify;">
<li>A bootable USB Stick</li>
<li>A full blown Linux OS with GUI</li>
<li>Evaluation Version of the Vertica Analytics database Fully installed and configured</li>
<li>Apache/Tomcat Web Server</li>
<li>LogiXML Dashboard Reporting</li>
<li>Sample Data and Dashboards</li>
</ul>
<h1 style="text-align: center;"><span style="color: #000080;">Introducing the V-Stick</span></h1>
<h2 style="text-align: center;"><span style="color: #000080;">Powered by Vertica and LogiXML</span></h2>
<p style="text-align: center;">
<p style="text-align: center;"><a href="http://www.bcsolution.com/wp-content/uploads/2009/07/Vstick_Image1.jpg"><img class="alignnone size-medium wp-image-448" title="The VStick" src="http://www.bcsolution.com/wp-content/uploads/2009/07/Vstick_Image1-300x82.jpg" alt="The VStick" width="300" height="82" /></a></p>
<p style="text-align: justify;">We are currently putting the final touches on a sample dash-board and then will begin testing the Stick on different configurations. We have setup a <a title="V-Stick Support Forum" href="http://www.bcsolution.com/data-warehouse-community/the-vstick/" target="_self">forum</a> specifically to support the V-Stick, so feel free to <a title="V-Stick Support Forum" href="http://www.bcsolution.com/data-warehouse-community/the-vstick/" target="_self">post</a> your questions there.</p>
<address style="text-align: justify;">Best Regards,</address>
<address style="text-align: justify;">Charlie<br />
</address>
<p style="text-align: justify;">
<span class="sfforumlink"><a href="http://www.bcsolution.com/data-warehouse-community/the-vstick/the-v-stick-is-coming"><img src="http://www.bcsolution.com/wp-content/plugins/simple-forum/styles/icons/default/bloglink.png" alt="" /> Join the forum discussion on this post</a> - (1) Posts</span>]]></content:encoded>
			<wfw:commentRss>http://www.bcsolution.com/2009/07/vstick-coming/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Vertica on a Stick</title>
		<link>http://www.bcsolution.com/2009/06/vertica-thumb-drive/</link>
		<comments>http://www.bcsolution.com/2009/06/vertica-thumb-drive/#comments</comments>
		<pubDate>Fri, 12 Jun 2009 04:42:24 +0000</pubDate>
		<dc:creator>cwardell</dc:creator>
				<category><![CDATA[Innovation]]></category>
		<category><![CDATA[MPP]]></category>
		<category><![CDATA[data warehouse]]></category>
		<category><![CDATA[Data Warehousing]]></category>
		<category><![CDATA[flash drive]]></category>
		<category><![CDATA[thumb drive]]></category>
		<category><![CDATA[Vertica]]></category>
		<category><![CDATA[vstick]]></category>

		<guid isPermaLink="false">http://www.bcsolution.com/?p=385</guid>
		<description><![CDATA[Can you believe it? A bootable 16 Gb thumb-drive with Linux, GUI, Vertica AND room for data!?
Yes, I admit it. I am a geek. But how cool is this. Imagine developing, prototyping, and presenting a data warehouse without procuring a server, installing Linux, or commandeering your hard drive. Everything you need is on a small [...]]]></description>
			<content:encoded><![CDATA[<p><strong>Can you believe it? A bootable 16 Gb thumb-drive with Linux, GUI, Vertica </strong><strong>AND room for data!?</strong></p>
<p>Yes, I admit it. I am a geek. But how cool is this. Imagine developing, prototyping, and presenting a data warehouse without procuring a server, installing Linux, or commandeering your hard drive. Everything you need is on a small bootable USB thumb-drive.<span id="more-385"></span></p>
<h2 style="text-align: center;"><strong><span style="color: #333399;">Welcome VStick</span></strong></h2>
<p> </p>
<p>There were a few challenges in creating the VStick, but the complications were mostly around finding the right Linux distro.</p>
<li style="text-align: left; padding-left: 30px;">I had to find the right thumb-drive the could sustain decent read/write rates</li>
<li style="text-align: left; padding-left: 30px;">Finding a distro that could be installed on the thumb-drive was easy, getting it to work well was a challenge</li>
<li style="text-align: left; padding-left: 30px;">The distro had to be small enough to fit on a thumb-drive yet leave room for Vertica and some data.</li>
<li style="text-align: left; padding-left: 30px;">I had to make sure that it had all of the features, GUI, Text editor, browser, etc.. that would help with the evaluation and development process.</li>
<li style="text-align: left; padding-left: 30px;">I did a first pass in shutting down the unneeded services to free up some system resources</li>
<li style="text-align: left; padding-left: 30px;">I went through a few load and query cycles to test it out.</li>
<p> </p>
<p style="text-align: center;"><a href="http://www.bcsolution.com/wp-content/uploads/2009/06/VStick-screen-shot1.png"><img class="size-large wp-image-388 aligncenter" title="VStick-screen-shot" src="http://www.bcsolution.com/wp-content/uploads/2009/06/VStick-screen-shot1-1024x640.png" alt="Vertica on a Stick" width="614" height="384" /></a></p>
<p> </p>
<p>Initially, I loaded some sample data and just about all the queries performed with sub-second response times. I am now in the process of cramming as much data on the stick that I possibly can. I am excited to see what the performance and compression would be like on the larger data set.</p>
<p>This little midnight engineering project really highlights the technology of Vertica.</p>
<li style="text-align: left; padding-left: 30px;">The size of the entire Vertica installation is a miniscule 8 MB so it is very light on resources</li>
<li style="text-align: left; padding-left: 30px;">The columnar approach and compression deployed on the database minimizes I/O substantially. So far, the impact of the slower flash drive has gone unnoticed during my tests.</li>
<li style="text-align: left; padding-left: 30px;">If Vertica performs this well on a laptop with a thumb-drive, imagine what it can do on a rack of modern servers</li>
<p>Stay tuned, over the next few days, I will be loading and querying a larger data set and will let you know how it goes. Outside of evaluating and developing on Vertica, I was wondering if there was a market for a data warehouse on a stick? Perhaps the embedded market like Kiosks. </p>
<p>Let me know if you have any ideas?</p>
<p><em>Regards</em>,</p>
<p><em>Charlie</em></p>
<p> </p>
<p><em><br />
</em></p>
<address></address>
<address></address>
<address></address>
<address></address>
]]></content:encoded>
			<wfw:commentRss>http://www.bcsolution.com/2009/06/vertica-thumb-drive/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Can Netezza Search for Aliens?</title>
		<link>http://www.bcsolution.com/2009/06/netezza-search-aliens/</link>
		<comments>http://www.bcsolution.com/2009/06/netezza-search-aliens/#comments</comments>
		<pubDate>Thu, 04 Jun 2009 05:58:56 +0000</pubDate>
		<dc:creator>cwardell</dc:creator>
				<category><![CDATA[Innovation]]></category>
		<category><![CDATA[MPP]]></category>
		<category><![CDATA[distributed computing]]></category>
		<category><![CDATA[Grid Computing]]></category>
		<category><![CDATA[High Performance Computing]]></category>
		<category><![CDATA[HPC]]></category>
		<category><![CDATA[Netezza]]></category>
		<category><![CDATA[parallelism]]></category>
		<category><![CDATA[Super Computer]]></category>

		<guid isPermaLink="false">http://www.bcsolution.com/?p=347</guid>
		<description><![CDATA[ 
While I am on the innovation &#8220;rant&#8221; it reminds me of a few discussions I had with some of the brass at Netezza. I have been fortunate to have been invited to the Netezza campus a few times and we had a few blue-sky sessions. (This was before they got their new digs and their IPO.) My [...]]]></description>
			<content:encoded><![CDATA[<p> </p>
<p style="text-align: justify;"><a href="http://www.bcsolution.com/wp-content/uploads/2009/06/cute-alien.jpg"><img class="alignleft size-full wp-image-374" title="Can Netezza Search for Aliens?" src="http://www.bcsolution.com/wp-content/uploads/2009/06/cute-alien.jpg" alt="Can Netezza Search for Aliens?" width="104" height="130" /></a>While I am on the innovation &#8220;rant&#8221; it reminds me of a few discussions I had with some of the brass at Netezza. I have been fortunate to have been invited to the Netezza campus a few times and we had a few blue-sky sessions. (This was before they got their new digs and their IPO.) My first impressions were great. The office had an entrepreneurial spirit and to my surprise the R&amp;D team seemed to be <strong>well seasoned veterans</strong>. After talking with them for a while, you just knew they have been around the block once or twice.</p>
<p style="text-align: justify;">My name is Charles, but my friends call me Charlie. A running joke with those who know me is that I am a terrible wine snob. Or should I say I am terrible at wine snobbery.  I know all the moves and can put on a good show but couldn&#8217;t tell the difference between 2 buck chuck and Grey Poupon. <span id="more-347"></span></p>
<p style="text-align: justify;"> </p>
<p style="text-align: justify;"><img class="size-medium wp-image-375 alignleft" title="Netezza and my &quot;2-Buck-Chuck&quot;" src="http://www.bcsolution.com/wp-content/uploads/2009/06/2buckchuckcork1-299x300.png" alt="Netezza and my &quot;2-Buck-Chuck&quot;" width="209" height="210" /></p>
<p style="text-align: justify;">As a joke one evening, Netezza had brought a bottle of two-buck-chuck along to dinner and I just happened to  love it. Needless to say they left me with a wonderful collection for my wine basement as well as a new nick-name from my friends at Netezza. A great memory and a pretty funny one, but I wanted to share one of the thoughts that came out of that evening. </p>
<p style="text-align: justify;">Netezza is a massively parallel processing appliance. In the Netezza world, your data repository is spread out as thinly as possible across as many Spu/Blades that it can. Each Spu/Blade is responsible for a small part of the entire data repository.</p>
<p style="text-align: justify;">Let&#8217;s keep it simple for now and forget aggregation steps, summations, group by, etc. For my illustration, lets just say I want to perform a straight full table scan query against 1 billion records. If I have even distribution of <strong>1 billion</strong> records across 100 SPU/Blades. The MAX LOAD or the MAX # OF RECORDS that any <strong>one</strong> Spu/Blade would have to read and filter is <strong>10 million</strong> records.</p>
<p style="text-align: justify;">In this particular example, a 1 billion record query would responds like it was a 10 Million record query. And the MPP is linear. If I had 200 Spu/Blades the same 1 Billion Record query would respond like a 5 million record query. This is MPP in a nutshell. I know that I am cutting out a LOT of details here but those details are not important for this post.</p>
<p style="text-align: justify;"> </p>
<p style="text-align: justify;"><strong>The innovation</strong></p>
<p style="text-align: justify;"><strong><a href="http://www.bcsolution.com/wp-content/uploads/2009/06/netezza-spu.jpg"><img class="alignnone size-medium wp-image-348" title="Netezza SPU Blade" src="http://www.bcsolution.com/wp-content/uploads/2009/06/netezza-spu-300x96.jpg" alt="Netezza SPU Blade" width="240" height="77" /></a></strong></p>
<p style="text-align: justify;">Each Spu/Blade is essentially a dedicated computer that has its own dedicated:</p>
<ul style="text-align: justify;">
<li>• CPU</li>
<li>• RAM</li>
<li>• Cache</li>
<li>• I/O Channel</li>
<li>• Network Adapter Card</li>
<li>• Disk</li>
<li>• and FPGA.</li>
</ul>
<p style="text-align: justify;">Although Netezza is focused on crunching enormous amounts of data, there is a huge opportunity waiting to be capitalized on. Netezza has opened up the FPGA. (Field programable Gate Array) to developers. The FPGA is programmed by a subset of the C language and compiled and deployed as a user defined function on each and every SPU/Blade. When each record passes through the FPGA it can be zapped by a native compiled User Defined Function.</p>
<p style="text-align: justify;">You may be saying so what!? User Defined Functions have been around for as long as I can remember. What&#8217;s the big deal? How about Distributed Computing? What if the Netezza performance server was used as a massively parallel compute engine. The universities/academia, health research (IE:Cancer, aids, human genome), or the ever important search for extra terrestrial and large prime numbers could all be candidates for an MPP appliance in a box.</p>
<p style="text-align: justify;"> </p>
<p style="text-align: justify;">The next two paragraphs are from <a title="Wikipedia Entry for SETI@Home" href="http://en.wikipedia.org/wiki/Seti@home" target="_blank">WikiPedia</a></p>
<blockquote style="text-align: justify;">
<p style="text-align: justify; "><a title="Seti@Home" href="http://setiathome.ssl.berkeley.edu/" target="_blank">SETI@home</a> (&#8221;SETI at home&#8221;) is a distributed computing (grid computing) project using Internet-connected computers, hosted by the Space Sciences Laboratory, at the University of California, Berkeley, in the United States. SETI is an acronym for the Search for Extra-Terrestrial Intelligence. SETI@home was released to the public on May 17, 1999.[1][2][3]</p>
<p style="text-align: justify; ">Statistics</p>
<p style="text-align: justify; ">With over 5.2 million participants worldwide, the project is the distributed computing project with the most participants to date. The original intent of SETI@home was to <strong>utilize 50,000-100,000 home computers</strong>.[citation needed] Since its launch on May 17, 1999, the <strong>project has logged over two million years of aggregate computing time. </strong>On September 26, 2001, SETI@home had performed a total of 1021 floating point operations. It is acknowledged by the Guinness World Records as the largest computation in history.[12] With over 334,155 active computers in the system (1.8 million total) in 210 countries, as of August 04, 2008, SETI@home has the ability to compute over 528 TeraFLOPS.[13] For comparison, Blue Gene (one of the world&#8217;s fastest supercomputers) peaks at just over 596 TFLOPS with sustained rate of 478 TFLOPS.</p>
</blockquote>
<p style="text-align: justify;"> </p>
<p style="text-align: justify;">Certain industries, like Life Sciences, pharmaceutical, financial, can&#8217;t go distributing data across thousands of volunteer computers worldwide. Procuring large grids to deploy their computational task is a challenging and costly task. Setting up a 50,000 node grid in a data center is not something most people would want to take on. The HVAC requirements for that size grid is just enormous.</p>
<p style="text-align: justify;">The choice of a PowerPC CPU on the Netezza SPU/Blade was no accident. It runs significantly cooler than Intel or AMD. The reduced heat at the SPU/Blade level allows for a MUCH higher density per rack, smaller foot print on the data center floor, and reduces the HVAC and electrical requirement substantially. Come on Netezza! Where are the &#8220;Going Green&#8221; campaigns?</p>
<p style="text-align: justify;">Netezza already handles the hardware, distribution, connectivity, and compiler for the SPU&#8217;s. As a result,  this may become a very viable approach for computational heavy studies. The next time your screen saver downloads a job packet from seti@home, think of the potential opportunity that Netezza could provide in finding <a title="Seti@Home" href="http://setiathome.ssl.berkeley.edu/" target="_blank">aliens</a>, the <a title="The Human Genome Project" href="http://www.ornl.gov/sci/techresources/Human_Genome/home.shtml" target="_blank">Human Genome project</a>, <a title="FightAids@Home" href="http://fightaidsathome.scripps.edu/" target="_blank">FightAIDS@HOME</a></p>
<p style="text-align: justify;">I wonder if Netezza could remove the DBOS and Storage and go-to market with a streamlined HPC Grid. (High Performance Computing Environment). Let me know what you think and  what industry you see the potential in.</p>
<p style="text-align: justify;"> </p>
<p style="text-align: justify;">That&#8217;s my blue sky for tonight.</p>
<p style="text-align: justify;">Charlie (AKA: 2buckchuck)</p>
<p style="text-align: justify;"> </p>
]]></content:encoded>
			<wfw:commentRss>http://www.bcsolution.com/2009/06/netezza-search-aliens/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Hacking Vertica?</title>
		<link>http://www.bcsolution.com/2009/05/vertica-hack-1/</link>
		<comments>http://www.bcsolution.com/2009/05/vertica-hack-1/#comments</comments>
		<pubDate>Sun, 31 May 2009 19:26:02 +0000</pubDate>
		<dc:creator>cwardell</dc:creator>
				<category><![CDATA[Innovation]]></category>
		<category><![CDATA[Netezza]]></category>
		<category><![CDATA[Teradata]]></category>
		<category><![CDATA[thumb drive]]></category>
		<category><![CDATA[Vertica]]></category>

		<guid isPermaLink="false">http://www.bcsolution.com/?p=322</guid>
		<description><![CDATA[Without fully disclosing what mad scienctist hack I am up to, the least I can do is tell you that It has something to do with Vertica and a thumb drive.]]></description>
			<content:encoded><![CDATA[<p style="text-align: justify;">I love innovation because of both the rewards and challenges that it brings. I have a passion for technology and love to experiment. I guess that I have grown up a little bit. Instead of taking my toaster apart, I simply watch the TV show &#8220;How it&#8217;s Made&#8221; .<span id="more-322"></span></p>
<p style="text-align: justify;">When it comes to software though, I feel out of place unless I have a wrench in my hand. The hands on experimentation allows me to be confident and non &#8220;theoretical&#8221; when I propose radical paradigm shifts to my customers. Without challenging the &#8220;traditional&#8221; and being somewhat of a Rebol, I don&#8217;t believe I would have successfully managed to get both Teradata and Netezza deployed in the pharmaceutical industry. This was a huge milestone for MPP and considered a pretty major accomplishment. <em>(Those of you who know pharma know first hand of the Oracle domination.)</em></p>
<p style="text-align: justify;"><em>By the way, </em><a href="http://www.rebol.com" target="_blank"><em>Rebol</em></a><em> above is not misspelled. It&#8217;s another &#8220;Secret Agent Man&#8221; technology I keep in my arsenal of weapons..</em></p>
<p style="text-align: justify;"><img class="size-full wp-image-326 alignright" title="Thumb Drive" src="http://www.bcsolution.com/wp-content/uploads/2009/05/patriotthumbdive.png" alt="Thumb Drive" width="250" height="188" /></p>
<p style="text-align: justify;">So what&#8217;s my next tinker?  I have many on my list. But I am closing in on something pretty radical. Without fully disclosing what mad scientist hack I am up to, the least I can do is tell you that It has something to do with <a href="http://www.vertica.com" target="_blank">Vertica</a> and a thumb drive. This will be helping in a future project of mine. (More details to come)</p>
<p style="text-align: justify;">Stay tuned  it may not be what you expect? The details should be posted within the next week or so.</p>
<p style="text-align: justify;">Best Regards,</p>
<p style="text-align: justify;">Charlie</p>
<p style="text-align: justify;"> </p>
<p style="text-align: justify;"> </p>
]]></content:encoded>
			<wfw:commentRss>http://www.bcsolution.com/2009/05/vertica-hack-1/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Netezza Paved the Way</title>
		<link>http://www.bcsolution.com/2009/05/netezza-paved-the-way/</link>
		<comments>http://www.bcsolution.com/2009/05/netezza-paved-the-way/#comments</comments>
		<pubDate>Sat, 30 May 2009 05:14:24 +0000</pubDate>
		<dc:creator>cwardell</dc:creator>
				<category><![CDATA[MPP]]></category>
		<category><![CDATA[BI]]></category>
		<category><![CDATA[Columnar Databases]]></category>
		<category><![CDATA[data warehouse]]></category>
		<category><![CDATA[Netezza]]></category>
		<category><![CDATA[Teradata]]></category>
		<category><![CDATA[Vertica]]></category>

		<guid isPermaLink="false">http://www.bcsolution.com/?p=310</guid>
		<description><![CDATA[Are we seeing a paradigm shift today as we saw when Netezza first launched their MPP appliance? Probably, but I think the road for Netezza is still pretty golden.]]></description>
			<content:encoded><![CDATA[<p style="text-align: justify;">Netezza did something revolutionary in the Data Warehouse space. They revolutionized, simplified, and packaged parallelism in a way that was just not done before. They simply got it right. Think about it. Netezza offered an extremely fast product with standards based SQL and connectivity at a price Teradata wouldn&#8217;t try to beat.<span id="more-310"></span></p>
<p style="text-align: justify;">Today you see a lot of traction with columnar databases. Rightfully so, when you understand the columnar approach and the minimization of I/O offered by technologies like Vertica it is a no-brainer to give it a try. Are we seeing a paradigm shift today as we saw when Netezza first launched their MPP appliance? Probably, but I think the road for Netezza is still pretty golden.</p>
<p style="text-align: justify;"><a href="http://www.bcsolution.com/wp-content/uploads/2009/05/netezza-frame.gif"><img class="alignleft size-full wp-image-315" title="Netezza Frame" src="http://www.bcsolution.com/wp-content/uploads/2009/05/netezza-frame.gif" alt="Netezza Frame" width="140" height="200" /></a>Netezza trumps the MPP space. Going public has validate not only the company but the technology. They have taken the survivability and risk of adoption question out of the picture. In doing so, Netezza has cleared the field and paved the way for technologies like Vertica and Paraccel.</p>
<p style="text-align: justify;">My guess is that over time, Netezza will win quite a few Teradata clients. As the capitalization of the Teradata investments come to an end, the need for increased storage capacity will grow. Faced with investing into more nodes, companies would be wise to re-evaluate their Teradata strategy and consider Netezza. (Given of course they are not locked into Teradata by purchasing the additional logical models or applications that run on it)</p>
<p style="text-align: justify;">Netezza will grab market share from Teradata and in doing so they just may keep NeoView in the gate for years to come. Oracle just doesn&#8217;t understand this space so at present, I don&#8217;t see any competitive threat from Larry for MPP architectures. I wouldn&#8217;t be surprised if Oracle made a bid for one of the Columnar databases though. (I would prefer however; that they stick with OLTP instead of misleading people into believing they can build large scale Query and Reporting databases on OLTP technologies.)</p>
<p style="text-align: justify;">If my hunch is correct, this opens up a tremendous opportunity for technologies like Vertica. I believe Netezza sees a great opportunity for C-Stores and surprisingly may welcome the competition to the space.</p>
<p style="text-align: justify;">I believe Netezza is focusing on the &#8220;prize&#8221;. I would assume that they want to be known as &#8220;The Brand&#8221; for Petabyte data warehouses. Technically there may be no size limit with Vertica, Paraccel, or Sybase. It is however my feeling that if an organization is crunching that much data, they will lean toward Teradata or Netezza.</p>
<p style="text-align: justify;">So I believe Vertica and Paraccel owes Netezza a few &#8220;props&#8221;. The success they are about to have is related to Netezza&#8217;s work in this space.</p>
<p style="text-align: justify;">In my next post, I will talk about a niche that Netezza may have outside of the data warehousing arena.</p>
<p style="text-align: justify;">Stay tuned..</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bcsolution.com/2009/05/netezza-paved-the-way/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Database in the Clouds?</title>
		<link>http://www.bcsolution.com/2009/05/database-clouds/</link>
		<comments>http://www.bcsolution.com/2009/05/database-clouds/#comments</comments>
		<pubDate>Wed, 27 May 2009 17:20:56 +0000</pubDate>
		<dc:creator>cwardell</dc:creator>
				<category><![CDATA[Tech Talk]]></category>
		<category><![CDATA[Cloud Datababase]]></category>
		<category><![CDATA[SimpleDB]]></category>

		<guid isPermaLink="false">http://www.bcsolution.com/?p=257</guid>
		<description><![CDATA[I am struggleing with this one. For VLDB's, I can't say I subscribe to the cloud databases like EC2, SimpleDB, etc.. I love the idea]]></description>
			<content:encoded><![CDATA[<p>I am struggling with this one. For VLDB&#8217;s, I can&#8217;t say I subscribe to the cloud databases like EC2, SimpleDB, etc.. I love the idea of throttling up or down the capacity on demand but I am not sure it is practical in the query and reporting sense.</p>
<p style="text-align: justify;">If I had a transactional system like Facebook, MySpace, Travelocity, Expedia, it would make much more sense for me. The data requirements are large but the data is accumulated over time, transaction by single transaction. Ad hoc queries, reporting, and analytics that happen on VLDB&#8217;s, usually, or at least, should not be performed against operational transaction based systems. There is usually an extract of the transactional data and it is housed in either a more suitable schema or better yet or more suitable VLDB technology.<span id="more-257"></span></p>
<p style="text-align: justify;">Here are a few questions and  challenges you may face:</p>
<ul>
<li>
<ul>
<li> • How do I get terabytes of data from my operational system  to the commodity clouds effective</li>
<li>• I am not sure I would want to say to my customers: &#8220;Just send your data to Amazon.com and I will take it from there.&#8221; Imagine how many hands it needs to touch before it gets to the floor of the data center.</li>
<li>• Are you in an industry (financial, pharmaceutical) where regulations are such that a multi-tenant grid would be frowned upon?</li>
</ul>
</li>
</ul>
<p style="text-align: justify;">I definitely see the positive side if the data was already at the cloud, but typically, we receive data via ftp in many different formats at different frequencies. They all need to be standardized and purified before it gets into the Cloud DB. So from my point of view, data warehousing is not a real good candidate for the Cloud.</p>
<p>I love the idea of the cloud and would welcome differing opinions. Am I missing something?</p>
<span class="sfforumlink"><a href="http://www.bcsolution.com/data-warehouse-community/vldb-dbms-technologies/database-in-the-clouds"><img src="http://www.bcsolution.com/wp-content/plugins/simple-forum/styles/icons/default/bloglink.png" alt="" /> Join the forum discussion on this post</a> - (1) Posts</span>]]></content:encoded>
			<wfw:commentRss>http://www.bcsolution.com/2009/05/database-clouds/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>&#8230;and this database is Just Right</title>
		<link>http://www.bcsolution.com/2009/05/teradata-netezza-vertica-choice/</link>
		<comments>http://www.bcsolution.com/2009/05/teradata-netezza-vertica-choice/#comments</comments>
		<pubDate>Tue, 26 May 2009 21:06:46 +0000</pubDate>
		<dc:creator>cwardell</dc:creator>
				<category><![CDATA[MPP]]></category>
		<category><![CDATA[Metapa]]></category>
		<category><![CDATA[Netezza]]></category>
		<category><![CDATA[Teradata]]></category>
		<category><![CDATA[Vertica]]></category>

		<guid isPermaLink="false">http://www.bcsolution.com/?p=238</guid>
		<description><![CDATA[There is no theoretical size limit on databases based on MPP architectures. (Massively parallel processing). The more nodes, SPUS, blades, or pizza boxes you have in your cluster, the more capacity your database has and the faster it will perform.  With that said, I am pretty confident that Teradata, Netezza and Vertica can scale to the Petabyte size. There seems to be no real inhibiting factor that I know of]]></description>
			<content:encoded><![CDATA[<p style="text-align: justify;">I have been fortunate to have created data warehouses in Oracle, Teradata, Netezza, SQL Server and Vertica. (I did a POC with Greenplum when it was called Metapa but it was not quite ready for prime time.)</p>
<p style="text-align: justify;">I have never been able to achieve performance on multi-terabyte databases using SMP (Symmetrical Multi Processing) servers and OLTP databases. In my opinion, the relational/OLTP databases like Oracle, SQLServer, MYSql, etc.. are just the wrong tool for adhoc queries, analytics and reporting of VLDB&#8217;s (very large databases). The results I obtained when utilizing MPP have been staggering when compared to the response times of query and analytics of OLTP databases.</p>
<p style="text-align: justify;"><span id="more-238"></span>There is no theoretical size limit on databases based on MPP architectures. (Massively parallel processing). The more nodes, SPUS, blades, or pizza boxes you have in your cluster, the more capacity your database has and the faster it will perform.  With that said, I am pretty confident that Teradata, Netezza and Vertica can scale to the Petabyte size. There seems to be no real inhibiting factor that I know of.</p>
<p style="text-align: justify;">
<p style="text-align: justify;"><strong>So what does one choose?</strong></p>
<p style="text-align: justify;">I maintained a Teradata environment for about 6 years and have gone under the hood pretty deeply with Netezza and Vertica. I have programmed at the FPGA level for Netezza and had the opportunity to pick the brains of the senior engineers at Vertica for a while. I am pretty confident that at this point, the MPP technologies I mention below should handle the workload.</p>
<p style="text-align: center; ">The question comes down to &#8220;<em>What is your budget and business need</em>?&#8221;</p>
<p style="text-align: justify;"><strong>Teradata</strong></p>
<p style="text-align: justify;">Teradata has a wide range of logical models and applications they can sell along with the WH. Teradata supports transactional processing and ad-hoc queries very nicely. Teradata also has a professional services team which contributes a large portion of their revenue, but all this comes at a premium price.</p>
<p style="text-align: justify;"><strong>Netezza</strong></p>
<p style="text-align: justify;"><strong><span style="font-weight: normal; ">Netezza is a beast of a machine that does full table scans extremely well, their FPGA technology is very impressive, and the zone indexing does a nice job in minimizing I/O. Netezza&#8217;s secret sauce is in their FPGA (Field programable gate arrays). Not to long ago, Netezza opened up the FPGA to developers and now give tremendous power by creating FPGA based UDF&#8217;s (User defined functions). These UDF&#8217;s perform exceptionally well and can put the afterburners on your query and analytics if done correctly. Netezza has proven the test of time and has a pretty impressive client list and is a very viable alternative to Teradata.</span></strong></p>
<p style="text-align: justify;"><strong>Vertica</strong></p>
<p style="text-align: justify;">Vertica is relatively a new comer to the VLDB space and they are definitely the one to watch. I like the technology and the fact that Vertica is led by Stonebraker who is a veteran in the industry. ALso quite notable is Jerry Held (Exec from Oracle and Tandem) who sits on the board.  You will need to grasp a slight paradigm shift in the way their database operates but it supports ansi standard SQL and ODBC connections so you shouldn&#8217;t have any problems. Vertica&#8217;s compression algorithms are extremely clever and when combined with a columnar data store they are breaking records. Vertica will definitely give you an A-HA! moment once you understand how the magic happens.</p>
<p style="text-align: justify;">By the way, Syncsort and Vertica broke the world * TPC record for data loads at a fraction of a price when compared to Netezza and Teradata.</p>
<blockquote><p>* A clarifying point on the TPC record. This test was not an instance of the TPC-H benchmark but rather a benchmark on load performance based on the TPC-H data produced by DBGEN.  The benchmark was audited by a 3rd party, published, and available for others to contest.</p></blockquote>
<p style="text-align: justify;">
<p style="text-align: justify;"><strong>Summary</strong></p>
<p style="text-align: justify;">Technically they will all crunch the data and they will all do it MUCH faster than OLTP based databases. Some questions you will need to consider:</p>
<ul style="text-align: justify;">
<li>
<ul>
<li>Q) Do I need or am I looking at other logical data models and applications to kick start my project?</li>
<li>Q) Do I want to leverage a professional services team?</li>
<li>Q) Do I need user defined functions or stored procedures?</li>
<li>Q) Does purchasing proprietary hardware scare you?</li>
<li>Q) Do I have a substantial budget or is this a warehouse on a shoestring?</li>
<li>Q) Is my DBA adaptable?</li>
<li>Q) Do I have a mixed workload?</li>
<li>Q) How many users will be hitting the database concurrently?</li>
</ul>
</li>
</ul>
<p style="text-align: justify;">
<p style="text-align: center; ">I know both Vertica and Netezza offers proof of concepts.</p>
<p style="text-align: center; "><strong>TAKE ADVANTAGE OF THE POCS!</strong></p>
<p style="text-align: center; "><strong><br />
</strong></p>
<p style="text-align: justify;">One approach you may want to consider is to start with a technology like Vertica. They offer a free trial and it installs easily on commodity hardware. This will provide the lowest cost of ownership and it just may fit your needs.  In parallel, work on setting up the POC with Netezza. This may take a little longer because staging a Netezza environment within your walls has many logistics involved.  Choosing the right technology is a crucial step in the development of a Very Large Data Warehouse. <strong>You should take your time on this critical phase and choose wisely. </strong></p>
<span class="sfforumlink"><a href="http://www.bcsolution.com/data-warehouse-community/vldb-dbms-technologies/and-this-database-is-just-right"><img src="http://www.bcsolution.com/wp-content/plugins/simple-forum/styles/icons/default/bloglink.png" alt="" /> Join the forum discussion on this post</a> - (1) Posts</span>]]></content:encoded>
			<wfw:commentRss>http://www.bcsolution.com/2009/05/teradata-netezza-vertica-choice/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
		</item>
	</channel>
</rss>
