Archive

Author Archive

DB2 for i › Expertise? Yes, I’ll have some

May 8th, 2013 Comments off


Last week I reached my own 25th anniversary working for IBM (imagine joining IBM on May 2nd 1988, being immediately sent to Rochester Minnesota, and having a mysterious future system go from rumor to real overnight!)

Working my entire IBM career in and around AS/400, iSeries and IBM i (and their users) has illuminated many wonderful things, the most profound being: truly brilliant and fiercely dedicated people.

In my last post I talked about the advantage of integration. What I did not say is, the expertise and wisdom of thousands of scientists, engineers and business folks are integrated into the IBM i platform. Maybe THAT is the real magic!


What is wisdom anyway?  Personally, I define and test wisdom using the following equation:


wisdom   =  knowledge  X  experience  X  time


To be considered truly wise, you must have the appropriate amount of knowledge for sure, but this is not enough. You must also have experience. In other words, you better have applied your knowledge in a practical and meaningful way.

Furthermore, it is a stretch to claim "wisdom" by only applying the knowledge once or twice. Ah, to be truly considered wise, you must have applied the knowledge many times, in many different circumstances, under many different conditions. As a corollary, I was once told that no one can be considered wise unless they have lived 50 years; I better leave that notion for a different conversation...

For sure the IBM i platform embodies the wisdom of IBMers, business partners and users from around the world. But the system is also surrounded by a team of people with an unrivaled breadth and depth in all things technical.



IBM STG Lab Services


Within IBM Systems and Technology Group exists a set of technical consultants whose sole mission is to ensure the success of IBM i users. That team is known as Lab Services.

As a member of the Lab Services team, I can state that we are here to guide you, educate you and provide services to you. We are here to connect the dots between Power Systems and IBM i, and between IBM i and DB2. Connect the dots between systems and storage, and determine what needs to be real and what can be virtual. We can separate, or combine blue sky and clouds.

When it comes to helping clients get the most out of their IBM i system and related business solutions, IBM STG Lab Services has the knowledge, the experience and the time.


If you would like to tap into this wisdom, please reach out to our business manager, Mark Even (even AT us.ibm.com).

And if you have a comment or success story to share, please join the conversation on facebook, where the IBM i 25th anniversary celebration continues.



Read the original at DB2 for i.

Categories: Blogs Tags:

DB2 for i › SEE

April 24th, 2013 Comments off


Simple

Efficient

Effective


These are the attributes sought out in almost everything we acquire and use to solve problems.

These are the attributes of an "appliance".

These are the attributes of the system known as "IBM i".

But what is the magic, the special sauce, the pixie dustthat makes "IBM i" simple, efficient and effective?

Before I reveal the answer, let me tell a story that represents a situation I find myself in all too often. Maybe the scenario will help you too!
____



An organization is in the process of evaluating a new and/or different business application to meet their needs. Whether the application is purchased or built is irrelevant to the conversation.

Assume that the application can be stood up on any of the common hardware and operating system stacks available today. In other words, the same application serves the business whether installed on Intel/Windows, Intel/UNIX, Intel/Linux, Power/AIX, Power/Linux, or Power/IBM i. You get the idea.

Once the application is chosen, the hardware and operating system arguments discussions begin. The various factions and alliances form up and make themselves known. The assumptions and so called "best practices" begin to manifest around every corner.

"UNIX is open"    "Linux is cheaper"

"Intel powers the most cost effective servers"

"Everyone knows that the application will work best on HP/Oracle"

"HA! IBM Power systems running AIX are more robust and scalable"


Now the database heavies show up. Their focus is only on data management and serving (and the requisite administration of such things).

"I don't care what hardware you run on, Oracle runs the app most effectively"

"If we are running the app on Windows, we absolutely have to use SQL Server"

"Obviously DB2 is the preferred database management system for this application"


Usually at this point, the organization's business leaders are thoroughly confused (and the respective hardware and software vendors haven't shown up yet)!

Off in the corner is a lone IT person; someone who has been around for 20+ years systematically providing value. This person has been solving business problems with the AS/400, then iSeries, then IBM i all along the organization’s 25 year growth curve.

This person knows what simple, efficient and effective really mean. This person realizes a terrible mistake is about to be made.

The lone IT person reaches out for help...
____


In writing this, I now realize it's been more than 6 years since I first constructed the thought provoking and repeatable response to the situation I describe above.  You see, if the application can run on any hardware, why run it on Power?   If the application can run in any operating system, why run it in IBM i?  If the application can use any relational database management system, why use DB2?

To answer these questions separately invites attack from all sides, including from so called colleagues and partners. To answer these questions holistically - well that's a game changer.
____


Through the power of networking (and some good luck), the lone IT person finds me. After an urgent phone conversation describing how the new platform search is about to throw out their beloved AS/400 for something “new and modern”, a strategy is put in place; a strategy that will neutralize the piecemeal competitors by revealing advanced technology, while illuminating the elegance of meeting all the requirements with a unified solution.
____


Given a choice, my strategy will always include getting in front of the organization’s leaders who are charged with making both the technical decision and the financial decision. It is advantageous to have the naysayers in the room too. After all, we should keep our friends close and our enemies closer.

While I have the rapt attention of the decision makers, each layer of the stack infrastructure is looked at through the lens of "IBM i". But keep in mind, employing this technique alone is doomed to failure. In other words, when talking about hardware, the propeller heads will crush you with speeds and feeds, and the accountants will pound you with acquisition costs. When talking about the operating system, the UNIX, Linux, Windows chorus will shout you down and press you into silence. And when talking about database, the Oracle, SQL Server and DB2 administrators will pile on and suffocate you with techno-speak involving strange acronyms, code words and something about creating spaces.

Even though you’ll be told there is a “better”, “cheaper”, “modern” alternative for each, never fear, due diligence must be done on each of these levels. The requirements for hardware, operating system and database must not be overlooked, nor under estimated. The unique attributes of the “IBM i system” at each level are fundamental to the solution and the audience must be exposed to them accordingly.
____ 


My presentation to the somewhat skeptical and possibly hostile audience proceeds in this fashion... 

All business solutions (read applications) need a solid, yet flexible foundation. IBM definitely can provide that.  Check.



Proceeding from the bottom up…

Illuminate and illustrate the features, functions and benefits of Power Systems (i.e. the hardware). Check.


Illuminate and illustrate the features, functions and benefits of IBM i (i.e. the OS). Check.



Illuminate and illustrate the features, functions and benefits of DB2 for i (i.e. the RDBMS). Check.




The advanced technology represented within the heretofore under appreciated system is now getting some attention. The audience begins to see their “legacy” system in a new and brilliant light. I catch a low, almost inaudible comment from a corner of the room, “wow, we have this? I didn’t know all this was in there, cool.”

But not so fast, the sundry factions represented in the audience have a counter and retort for each respective level of the stack. They sense a "me too" argument that can turn in their favor.

Meanwhile, the real audience - the decision makers that need to see the light, are still blind, searching for my hand and the promise of salvation.

Anticipating the mood and position of the audience, I pose the setup question:

If there is another, perfectly acceptable alternative for the hardware, the operating system and the database management system respectively, what’s the advantage of going with IBM i?
 


And I slowly reveal the answer…


____


Every IT vendor, including IBM, continues to seek a level of deep and profound integration that results in efficiency and cost effectiveness. They search for integration that will hide complexity and time consuming administration. In most cases, they search in vain.

Integration is not an afterthought, it happens from the beginning, and is sustained throughout.

Integration is not a marketing magic trick, it’s real, and it’s sustainable.

Integration does not happen by accident, it happens on purpose through vision and foresight.




In 1988, the platform introduced to the world as the Application System/400 was viewed as the paragon of integration – far surpassing anything in existence. The innovative and unique idea of hardware, operating system, database and solution seamlessly woven together as the IBM i system continues to be exploited twenty-five years later by tens of thousands of businesses around the world. 

The notion of a simple, efficient and effective system never goes out of style.
____ 


Why IBM i?

You SEE, it's all about the INTEGRATION. 


For more of the IBM i discussion and celebration look here.

If you are still a bit confused about DB2 for i, go back and read this.  And of course, feel free to reach out...


Read the original at DB2 for i.

Categories: Blogs Tags:

DB2 for i › SEE

April 24th, 2013 Comments off


Simple

Efficient

Effective


These are the attributes sought out in almost everything we acquire and use to solve problems.

These are the attributes of an "appliance".

These are the attributes of the system known as "IBM i".

But what is the magic, the special sauce, the pixie dustthat makes "IBM i" simple, efficient and effective?

Before I reveal the answer, let me tell a story that represents a situation I find myself in all too often. Maybe the scenario will help you too!
____



An organization is in the process of evaluating a new and/or different business application to meet their needs. Whether the application is purchased or built is irrelevant to the conversation.

Assume that the application can be stood up on any of the common hardware and operating system stacks available today. In other words, the same application serves the business whether installed on Intel/Windows, Intel/UNIX, Intel/Linux, Power/AIX, Power/Linux, or Power/IBM i. You get the idea.

Once the application is chosen, the hardware and operating system arguments discussions begin. The various factions and alliances form up and make themselves known. The assumptions and so called "best practices" begin to manifest around every corner.

"UNIX is open"    "Linux is cheaper"

"Intel powers the most cost effective servers"

"Everyone knows that the application will work best on HP/Oracle"

"HA! IBM Power systems running AIX are more robust and scalable"


Now the database heavies show up. Their focus is only on data management and serving (and the requisite administration of such things).

"I don't care what hardware you run on, Oracle runs the app most effectively"

"If we are running the app on Windows, we absolutely have to use SQL Server"

"Obviously DB2 is the preferred database management system for this application"


Usually at this point, the organization's business leaders are thoroughly confused (and the respective hardware and software vendors haven't shown up yet)!

Off in the corner is a lone IT person; someone who has been around for 20+ years systematically providing value. This person has been solving business problems with the AS/400, then iSeries, then IBM i all along the organization’s 25 year growth curve.

This person knows what simple, efficient and effective really mean. This person realizes a terrible mistake is about to be made.

The lone IT person reaches out for help...
____


In writing this, I now realize it's been more than 6 years since I first constructed the thought provoking and repeatable response to the situation I describe above.  You see, if the application can run on any hardware, why run it on Power?   If the application can run in any operating system, why run it in IBM i?  If the application can use any relational database management system, why use DB2?

To answer these questions separately invites attack from all sides, including from so called colleagues and partners. To answer these questions holistically - well that's a game changer.
____


Through the power of networking (and some good luck), the lone IT person finds me. After an urgent phone conversation describing how the new platform search is about to throw out their beloved AS/400 for something “new and modern”, a strategy is put in place; a strategy that will neutralize the piecemeal competitors by revealing advanced technology, while illuminating the elegance of meeting all the requirements with a unified solution.
____


Given a choice, my strategy will always include getting in front of the organization’s leaders who are charged with making both the technical decision and the financial decision. It is advantageous to have the naysayers in the room too. After all, we should keep our friends close and our enemies closer.

While I have the rapt attention of the decision makers, each layer of the stack infrastructure is looked at through the lens of "IBM i". But keep in mind, employing this technique alone is doomed to failure. In other words, when talking about hardware, the propeller heads will crush you with speeds and feeds, and the accountants will pound you with acquisition costs. When talking about the operating system, the UNIX, Linux, Windows chorus will shout you down and press you into silence. And when talking about database, the Oracle, SQL Server and DB2 administrators will pile on and suffocate you with techno-speak involving strange acronyms, code words and something about creating spaces.

Even though you’ll be told there is a “better”, “cheaper”, “modern” alternative for each, never fear, due diligence must be done on each of these levels. The requirements for hardware, operating system and database must not be overlooked, nor under estimated. The unique attributes of the “IBM i system” at each level are fundamental to the solution and the audience must be exposed to them accordingly.
____ 


My presentation to the somewhat skeptical and possibly hostile audience proceeds in this fashion... 

All business solutions (read applications) need a solid, yet flexible foundation. IBM definitely can provide that.  Check.



Proceeding from the bottom up…

Illuminate and illustrate the features, functions and benefits of Power Systems (i.e. the hardware). Check.


Illuminate and illustrate the features, functions and benefits of IBM i (i.e. the OS). Check.



Illuminate and illustrate the features, functions and benefits of DB2 for i (i.e. the RDBMS). Check.




The advanced technology represented within the heretofore under appreciated system is now getting some attention. The audience begins to see their “legacy” system in a new and brilliant light. I catch a low, almost inaudible comment from a corner of the room, “wow, we have this? I didn’t know all this was in there, cool.”

But not so fast, the sundry factions represented in the audience have a counter and retort for each respective level of the stack. They sense a "me too" argument that can turn in their favor.

Meanwhile, the real audience - the decision makers that need to see the light, are still blind, searching for my hand and the promise of salvation.

Anticipating the mood and position of the audience, I pose the setup question:

If there is another, perfectly acceptable alternative for the hardware, the operating system and the database management system respectively, what’s the advantage of going with IBM i?
 


And I slowly reveal the answer…


____


Every IT vendor, including IBM, continues to seek a level of deep and profound integration that results in efficiency and cost effectiveness. They search for integration that will hide complexity and time consuming administration. In most cases, they search in vain.

Integration is not an afterthought, it happens from the beginning, and is sustained throughout.

Integration is not a marketing magic trick, it’s real, and it’s sustainable.

Integration does not happen by accident, it happens on purpose through vision and foresight.




In 1988, the platform introduced to the world as the Application System/400 was viewed as the paragon of integration – far surpassing anything in existence. The innovative and unique idea of hardware, operating system, database and solution seamlessly woven together as the IBM i system continues to be exploited twenty-five years later by tens of thousands of businesses around the world. 

The notion of a simple, efficient and effective system never goes out of style.
____ 


Why IBM i?

You SEE, it's all about the INTEGRATION. 


For more of the IBM i discussion and celebration look here.

If you are still a bit confused about DB2 for i, go back and read this.  And of course, feel free to reach out...


Read the original at DB2 for i.

Categories: Blogs Tags:

DB2 for i › Sustained Business Value – A Rare Commodity

April 9th, 2013 Comments off
788 400 000 seconds

13 140 000 minutes

219 000 hours

9 125 days

25 years

two and a half decades

a quarter century

  • A computing platform that changed the game of information technology
  • A business tool that allowed organizations large and small to provide products and services much more effectively
  • A business solution that set an extremely high bar in terms of innovation and value
  • A system that continues to be the very model of integration and simplicity


The legendary and world renowned IBM i  reaches a significant milestone this year!


This week the reflection, recognition and rejoicing kicked off at the COMMON conference.

I highly recommend you follow the conversation here.

Over the course of the next few weeks, look for deeper discussions here on my blog, as well these:

You and i

i Can

iDevelop


If you have been taking advantage of AS/400, iSeries, and/or IBM i for a couple decades, sit back and bask in the knowledge that you made a very wise choice.  If you haven't, then pay attention to what you've been missing, and more importantly, join the club!



Read the original at DB2 for i.

Categories: Blogs Tags:

DB2 for i › Sustained Business Value – A Rare Commodity

April 9th, 2013 Comments off
788 400 000 seconds

13 140 000 minutes

219 000 hours

9 125 days

25 years

two and a half decades

a quarter century

  • A computing platform that changed the game of information technology
  • A business tool that allowed organizations large and small to provide products and services much more effectively
  • A business solution that set an extremely high bar in terms of innovation and value
  • A system that continues to be the very model of integration and simplicity


The legendary and world renowned IBM i  reaches a significant milestone this year!


This week the reflection, recognition and rejoicing kicked off at the COMMON conference.

I highly recommend you follow the conversation here.

Over the course of the next few weeks, look for deeper discussions here on my blog, as well these:

You and i

i Can

iDevelop


If you have been taking advantage of AS/400, iSeries, and/or IBM i for a couple decades, sit back and bask in the knowledge that you made a very wise choice.  If you haven't, then pay attention to what you've been missing, and more importantly, join the club!



Read the original at DB2 for i.

Categories: Blogs Tags:

DB2 for i › More Advice! Is this Good OR Bad?

March 29th, 2013 Comments off

With the arrival of IBM i TR6 we find the DB2 for i Query Optimizer enhanced to provide more index advice. In a previous post I discussed the philosophy behind generating index advice during the SQL query optimization process.  I also called out the limitations in the advice provided. One limitation of note involves no advice given for different columns ORed together, such as:

SELECT    *
FROM       NFL_TEAMS
WHERE    COLOR = 'PURPLE' OR EATS_PEOPLE = 'YES';

If you've attended our DB2 for i SQL Performance class, you know that multiple indexes over the same table can be used to assist the database engine with identifying rows of interest. This technique is known as "index ORing". But without advice from optimization, you are on your own to determine the appropriate indexes to create.

Keep in mind that when ORing the same column together, the optimizer has always been able to produce advice for queries such as:

SELECT    *
FROM        NFL_TEAMS
WHERE     COLOR = 'BLACK' OR COLOR = 'BLUE'


With the latest database group PTFs installed, the query optimizer will now advise indexes for up to five ORed predicates involving different columns. Additional details can be found here.

If your query has more than five columns ORed together, there will be no index advice produced (had to draw the line somewhere). Besides, ORing five different columns together is likely to result in most of rows in the table being selected anyway - in which case a full table scan is the best option, who knows...

There are additional limitations when mixing ANDed and ORed predicates in the query as well.


Some Examples

 

The following query now produces index advice (two separate indexes). Indexes that can be used to understand the data, as well as used for implementation:

select   *
from    orders
where  orderpriority = '5-LOW'
OR      shipmode = 'AIR';




Notice that the next query produces index advice consisting of one index (this is business as usual):

select    *
from     orders
where   orderpriority = '5-LOW'
AND    shipmode = 'AIR';






When more than five different columns are ORed together, no advice is produced:

select    count(*)
from     orders
where   orderpriority = '5-LOW'
OR       shipmode = 'AIR'
OR       quantity = 1
OR       year = 2013
OR       shippriority = 0
OR       orderkey = 1
OR       custkey = 10;






As mentioned previously, when mixing ANDed and ORed predicates in the query there are still limitations in the advice that can be generated. This is due in part to the number of key combinations that could be produced. In the following query, the optimizer only gives advice for the same column(s) appearing on both sides of the OR condition (i.e. YEAR). It is not going to advise two separate indexes.

select    *
from    orders
where  (year = 2013 and shipmode = 'AIR')
OR      (year = 2013 and orderpriority = '5-LOW');





I'll let you figure out why the optimizer only advises one index in the next case, and why it contains columns (YEAR, ORDERPRIORITY, SHIPMODE):

SELECT *
FROM   orders
WHERE  (year = 2013
        AND month = 1
        AND ((orderpriority = '5-LOW'
              AND shipmode = 'AIR')
             OR (orderpriority = '1-URGENT'
                 AND shipmode = 'TRUCK')))
       OR (year = 2012
           AND quarter = 4
           AND ((orderpriority = '5-LOW'
                 AND shipmode = 'AIR')
                OR (orderpriority = '1-URGENT'
                    AND shipmode = 'TRUCK')));






Join In


The new index advice for ORs also works with JOIN conditions (and of course, when mixing ANDs with ORs, the same limitations apply):

SELECT    o.*
FROM       orders AS o
INNER JOIN
                  dates AS d
ON           (o.shipdate = d.datekey OR o.receiptdate = d.datekey)
WHERE    d.year = 2013
AND         d.month = 1;





Words of Wisdom


Please reread my post: Index Advice - Take it or Leave it.  Read it again, and take it to heart. Analyzing SQL queries and tuning them via indexing techniques is both science and art. For example, should you create a radix index, or should you create an encoded vector index? I don't know, it depends. Obviously, seeking out the dependencies is important.

Knowledge, understanding and experience are required to really make sense of optimization and SQL query behavior. Getting educated and trained is the best and most important first step.

And rest assured that DB2 for i is working hard to optimize your queries to allow the best possible performance with your data, in your environment. This includes taking its own advice and creating the indexes for you...


Read the original at DB2 for i.

Categories: Blogs Tags:

DB2 for i › More Advice! Is this Good OR Bad?

March 29th, 2013 Comments off

With the arrival of IBM i TR6 we find the DB2 for i Query Optimizer enhanced to provide more index advice. In a previous post I discussed the philosophy behind generating index advice during the SQL query optimization process.  I also called out the limitations in the advice provided. One limitation of note involves no advice given for different columns ORed together, such as:

SELECT    *
FROM       NFL_TEAMS
WHERE    COLOR = 'PURPLE' OR EATS_PEOPLE = 'YES';

If you've attended our DB2 for i SQL Performance class, you know that multiple indexes over the same table can be used to assist the database engine with identifying rows of interest. This technique is known as "index ORing". But without advice from optimization, you are on your own to determine the appropriate indexes to create.

Keep in mind that when ORing the same column together, the optimizer has always been able to produce advice for queries such as:

SELECT    *
FROM        NFL_TEAMS
WHERE     COLOR = 'BLACK' OR COLOR = 'BLUE'


With the latest database group PTFs installed, the query optimizer will now advise indexes for up to five ORed predicates involving different columns. Additional details can be found here.

If your query has more than five columns ORed together, there will be no index advice produced (had to draw the line somewhere). Besides, ORing five different columns together is likely to result in most of rows in the table being selected anyway - in which case a full table scan is the best option, who knows...

There are additional limitations when mixing ANDed and ORed predicates in the query as well.


Some Examples

 

The following query now produces index advice (two separate indexes). Indexes that can be used to understand the data, as well as used for implementation:

select   *
from    orders
where  orderpriority = '5-LOW'
OR      shipmode = 'AIR';




Notice that the next query produces index advice consisting of one index (this is business as usual):

select    *
from     orders
where   orderpriority = '5-LOW'
AND    shipmode = 'AIR';






When more than five different columns are ORed together, no advice is produced:

select    count(*)
from     orders
where   orderpriority = '5-LOW'
OR       shipmode = 'AIR'
OR       quantity = 1
OR       year = 2013
OR       shippriority = 0
OR       orderkey = 1
OR       custkey = 10;






As mentioned previously, when mixing ANDed and ORed predicates in the query there are still limitations in the advice that can be generated. This is due in part to the number of key combinations that could be produced. In the following query, the optimizer only gives advice for the same column(s) appearing on both sides of the OR condition (i.e. YEAR). It is not going to advise two separate indexes.

select    *
from    orders
where  (year = 2013 and shipmode = 'AIR')
OR      (year = 2013 and orderpriority = '5-LOW');





I'll let you figure out why the optimizer only advises one index in the next case, and why it contains columns (YEAR, ORDERPRIORITY, SHIPMODE):

SELECT *
FROM   orders
WHERE  (year = 2013
        AND month = 1
        AND ((orderpriority = '5-LOW'
              AND shipmode = 'AIR')
             OR (orderpriority = '1-URGENT'
                 AND shipmode = 'TRUCK')))
       OR (year = 2012
           AND quarter = 4
           AND ((orderpriority = '5-LOW'
                 AND shipmode = 'AIR')
                OR (orderpriority = '1-URGENT'
                    AND shipmode = 'TRUCK')));






Join In


The new index advice for ORs also works with JOIN conditions (and of course, when mixing ANDs with ORs, the same limitations apply):

SELECT    o.*
FROM       orders AS o
INNER JOIN
                  dates AS d
ON           (o.shipdate = d.datekey OR o.receiptdate = d.datekey)
WHERE    d.year = 2013
AND         d.month = 1;





Words of Wisdom


Please reread my post: Index Advice - Take it or Leave it.  Read it again, and take it to heart. Analyzing SQL queries and tuning them via indexing techniques is both science and art. For example, should you create a radix index, or should you create an encoded vector index? I don't know, it depends. Obviously, seeking out the dependencies is important.

Knowledge, understanding and experience are required to really make sense of optimization and SQL query behavior. Getting educated and trained is the best and most important first step.

And rest assured that DB2 for i is working hard to optimize your queries to allow the best possible performance with your data, in your environment. This includes taking its own advice and creating the indexes for you...


Read the original at DB2 for i.

Categories: Blogs Tags:

DB2 for i › As I See it: Victor Rozek is Very Good

March 25th, 2013 Comments off
Or he has a very special set of glasses.


As some of you know, IT Jungle publishes an electronic newsletter called The Four Hundred. I read one author's articles religiously. That author is Victor Rozek.

I believe (from reading) that Victor has a fantastic ability to connect the dots, and illuminate the connections in an insightful way.

Instead of just shining a bright white light on his subject of choice, he also points beams consisting of a single color, just to see what might be revealed. More often than not, he gives the reader a glimpse into a totally unexpected or otherwise unknown aspect. Thus it will come as no surprise that his column is entitled: "As I See It:".  HA!  more like "do YOU see it!".

His most recent article is no exception. Because the article is about data proliferation and the related pursuits of piling it up and sifting through it, I'm calling it out.

Whether you do data or not, you should take a moment to read Victor's article here.




Read the original at DB2 for i.

Categories: Blogs Tags:

DB2 for i › As I See it: Victor Rozek is Very Good

March 25th, 2013 Comments off
Or he has a very special set of glasses.


As some of you know, IT Jungle publishes an electronic newsletter called The Four Hundred. I read one author's articles religiously. That author is Victor Rozek.

I believe (from reading) that Victor has a fantastic ability to connect the dots, and illuminate the connections in an insightful way.

Instead of just shining a bright white light on his subject of choice, he also points beams consisting of a single color, just to see what might be revealed. More often than not, he gives the reader a glimpse into a totally unexpected or otherwise unknown aspect. Thus it will come as no surprise that his column is entitled: "As I See It:".  HA!  more like "do YOU see it!".

His most recent article is no exception. Because the article is about data proliferation and the related pursuits of piling it up and sifting through it, I'm calling it out.

Whether you do data or not, you should take a moment to read Victor's article here.




Read the original at DB2 for i.

Categories: Blogs Tags:

DB2 for i › Living Large

March 12th, 2013 Comments off

There is a difference - a BIG difference - between “doing database” and doing “very large database”.

Tom McKinley, my longtime partner in this business of database regularly states, “bugs don’t scale” - meaning: what works just fine at a small size doesn’t necessarily work at a large size, and more than likely will NEVER work at a very large size.


It’s vitally important to first recognize and acknowledge that you are living in the realm of very large database (which is commonly abbreviated as “VLDB” by the way). Once recognized and acknowledged, you can begin to understand. That is, understand the critical success factors and best practices for maintaining and exploiting an IBM i VLDB environment.


Some History


I’ve been working in and around very large database since 1997, having helped to establish and lead the AS/400 Teraplex Center in Rochester Minnesota.  During the five year reign of our terrorTeraplex, we had the privilege to privately own and use the largest AS/400 systems built to date. Embarrassingly huge things. We not only had one, but we had several maxed out configurations. What can you do with a very large system at your disposal? Well, our mission was to research, quantify and explain very large database capabilities in the OS/400 environment. The driving force at the time was primarily data warehousing and business intelligence, as well as the emerging technology of parallelism. We were the first to demonstrate the positive benefits of exploiting 20GB of memory, a new system maximum at the time. I know, I know, 20GB doesn’t sound like much today but remember, WinTel database servers were stuck at 4GB and hitting that limit required buying another server for the farm (pause for effect and reflection). Of course today, an IBM Power system can have 16TB of memory. Wow, have times changed.

Back in the day, we were the first to build and use a 2.15 billion row table, then a 4.3 billion row table. And the first to build and use a 512GB table, then a 1TB table and then a 1.7TB table. We were the first to build large indexes using DB2 Symmetrical Multiprocessing feature to take advantage of all 8, then 12, then 24 processors. Ahhh, the days when a single job could light up all the front panel lights while pushing 24 CPUs to the max in one big whoosh of power! I really miss the lights…

Lo and behold, through the infinite power of the internet, I managed to dig up an article from 1998 that announced the results from my team’s creation and use of the largest single DB2/400 table to date – an unimaginable 2.5 terabytes comprised of 10 billion rows in a single table. For a blast from the past, you can find and read the article here

As you can see, the database capabilities of IBM i running on past and present Power systems are historic and vast – and unfortunately, little known (but you’re now coming up to speed, and spreading the word, right?).


What is VLDB, and Do I Have It?


The definition of very large database varies greatly, with new high water marks and attributes being established almost daily. This is especially true given the explosive expansion of data from all sorts of sources and events (see Big Data). But what about in the IBM i environment, what do we consider to be very large? Or better yet, when should you start to think and behave differently due to the size of a table or index?
 

Generally speaking, we start to seriously consider VLDB techniques and practices when a single table (or physical file) approaches 500GB in total size and/or the number of rows approaches or exceeds 1 billion (that’s 1 000 000 000 for my UK friends).

Another major consideration is the rate of data growth and the trend line illustrating when the table or physical file size limit will be reached. An event to watch out for is acquisition; when your company decides to branch out and/or acquire a large set of clients, customers or market share. If the current state of the database is unknown or unaccounted for, the acquisition and corresponding mass addition of data can push you into VLDB territory, or throw you up against a limit in one fell swoop.

Limit! What Limit?


Yes, Virginia, there is a limit. Even though DB2 for i enjoys some of the highest limits of any relational database system available, there are indeed maximums for a single data space (i.e. the container that physically holds the data). The two limits we refer to when talking about very large database are: size and number of rows.

Drum roll please... AND THE LIMITS ARE:

     1.7 terabytes or 4.3 billion (4 294 967 288) rows for a table / physical file

     1 terabyte for an index / keyed logical file

By the way, all of the DB2 for i limits can be found in the SQL Reference - Appendix A. 

Hey… wait a minute. If the limit is 1.7TB and/or 4.3 billion rows, how did you create a table that was 2.5TB in size containing 10 billion rows?

Simple, my team not only knows the limits, but more importantly, knows how and when to overcome them.

And so should you!


What Not To Do


In the past 2 months, the DB2 for i Center of Excellence has been involved in no less than four critical situations where IBM i users unknowingly hit the table limit. Sorry, let me rephrase that. They knew they hit something, ‘cause their application stopped working. But they didn’t know what they hit.  And frankly, THAT is the issue. These folks had no idea that a) there is a limit, b) they were really close to it, and c) when the limit is reached, the business stops. Have a nice day. 


What To Do


If you think your tables are getting big, or you are just curious, ask the system to show the metadata (i.e. data about data).  Try this:

Get the latest version of IBM i Navigator (aka IBM i Access - System i Navigator) – this would be the IBM i 7.1 Windows client. Install it, along with the latest service pack. Connect to your DB2 for i system and invoke Health Center.



The DB2 for i Health Center can show you a lot of things, but we are interested in the Size Limits for objects in a particular schema.



If you want to change the schema of interest, hit the button and do so:



When selecting the Size Limits tab, we get the opportunity to view the table and index current size compared to the maximum size or limit:




Take a look at both the table size and number of rows. If either of these are approaching the limit, you'll see a warning indicating a critical state. Please do be aware that if the schema has a lot of tables in it, this operation might take a few minutes.




If you are interested in where the index size is at, select this object type:



If you want a programmatic approach to reviewing your table and index sizes as compared to the limits, there is a stored procedure called QSYS2.Health_Size_Limits () and the description can be found here.


Do More


Strategically speaking, here is my list of IBM i Very Large Database critical success factors (aka “do or die”): 

2: get educated - obtain skills and expertise in all things DB2 for i
3: assess the current state of data, and identify the data growth and usage trends
4: build and implement a plan to handle large tables and indexes if approaching limits
5: build and implement a plan to modernize and re-engineer database objects and data access 

_____


The IBM DB2 for i Center of Excellence is the foremost (and dare I say only) team in the world fully qualified to assist you with very large database in the IBM i environment. As a matter of fact, the table limits and VLDB capabilities are what they are today because of the past research and leadership of the AS/400 Teraplex Center and the expertise of the men and women in the DB2 for i Development Laboratory.

If you need help with understanding the current state of your growing databases, or with meeting requirements for large data and scalability, contact me. We have a DB2 for i Very Large Database Consulting Workshop available that can identify solutions for overcoming limits to growth and ensure you are getting the most out of IBM i. We can also do remote database and application assessments to help identify any looming issues sooner than later – that is to say, before your business hits a wall. 

At the end of the day, rest assured that DB2 for i can handle your data whether you count rows in the millions or in the billions, assuming you embark on proper planning and implementation of the very large database practices and procedures for IBM i.





Read the original at DB2 for i.

Categories: Blogs Tags: