Dynamic Load Balancer Plustm


Previous Contents Index

Chapter 5
Starting DLB Plus I/O Analysis

The DLB Plus I/O analysis and reporting system is very easy to use. A DCL command is entered which executes the DLBPLUS.COM file. This command file initializes the DLB Plus I/O analysis and reporting system and presents the user with menus from which processing and report selections can be made.

It is recommended that a separate directory be created into which DLB Plus can store collected data. When you run the DLB Plus analysis and reporting system, the directory you are in --- is the directory that will contain the diagnostic data collected by DLB Plus. The collected data is used for the I/O analysis reports and graphs.

When DLB Plus is run and I/O data is collected, the collected (sampled) data represents a snapshot of the images (programs) currently executing on the node from which you are running DLB Plus.

Note

If there are two or more nodes on your cluster, you must log onto each node to collect the data for that node.

If needed, DLB Plus temporarily acquires the following privilege for analyzing I/O information: TMPMBX.

DLB Plus requires a BYTLM quota of at least 50000 and a PGFLQUO of at least 35000.

If you have to make changes to these authorization parameters, you must log off and log back on for the changes to take effect.

5.1 Activating DLB Plus Master Menu

Note

If you want DLB Plus to store sampled data in a specific directory without having to set your DEFAULT to that directory each time you SAMPLE a node, define a PROCESS or SYSTEM logical of "TTI_DLBP_DATA_LOC". For example:


  define/system TTI_DLBP_DATA_LOC "$1$DUA1:[MYNAME.DATA]" 

To start using the DLB Plus I/O analysis and reporting system, log onto the SYSTEM account:


    Username:  SYSTEM 
    Password:  (enter your system password) 

To activate the DLB Plus Master Menu, first create the directory in which you want the collected data to be stored and set your default to that directory. For example:


    $ CREATE/DIRECTORY $1$DISK1:[MYNAME.DATA] 
 
    $ SET DEFAULT $1$DISK1:[MYNAME.DATA]     <--- enter if no logical created 
 
    $ SET DEFAULT TTI_DLBP_DATA_LOC          <--- enter if logical created 

Then, enter the following command:


    $ @TTI_DLB:DLBPLUS 

When the DLB Plus I/O analysis and reporting system is activated, the DLB Plus package name, version number and copyright information are displayed. A few seconds later, the screen clears and the "DLB Plus" menu displays a message that DLB Plus is being initialized.

After DLB Plus has been initialized, the screen clears and the "Master Menu" is displayed.

Example 5-1 DLB Plus Master Menu


 DLB Plus                 Dynamic Load Balancer PLUS                01-Sep-1995 
 
 +-------------------------------Master Menu-------------------------------+ 
 |                                                                         | 
 |        Dynamic Load Balancing                 Reports and Graphs        | 
 |  DISPLAY   DLB Display Utility          SREPORTS  Summary Reports...    | 
 |  STATUS    DLB Status Utility           DREPORTS  Detail Reports...     | 
 |                                         GRAPHS    Analysis Graphs...    | 
 |             I/O Sampling                SCAN      Batch I/O Scan        | 
 |  SAMPLE    Sample I/Os and Locks                                        | 
 |  CONS      Consolidate Sample Data             Batch Procedures         | 
 |  SELECT    Select a Data Sample         BATCH     Set Up Batch Stream   | 
 |  LIVE      Live Lock Analysis...        SUBMIT    Submit Batch Stream   | 
 |  AUTO      Automatic Sampling           CANCEL    Cancel Batch Set Up   | 
 |                                                                         | 
 |                                                 Miscellaneous           | 
 |                                         SYSTEM    VMS System Commands   | 
 |                                         EXIT      EXIT DLB Plus         | 
 +-------------------------------------------------------------------------+ 
 
 
 
 
EXIT = Exit                                                \ = Back  HELP = Help

Select the SAMPLE item from the menu. The SAMPLE item collects I/O data by taking a snapshot of the images currently running on the node that you are logged onto (see Chapter 9, Sampling, Consolidating, and Selecting).

For each node to be analyzed, perform the following steps:

To obtain a complete picture of all I/O activity on all the nodes within a cluster, it is necessary to log onto each node, run DLB Plus from that node and collect I/O data for that node. DLB Plus must be run on each individual node because DLB Plus only samples data for the node on which it is being run.

After all the samples have been taken and the I/O data has been collected:


Chapter 6
Using DLB Plus

6.1 Master Menu

After the DLB Plus Master Menu is activated (see Section 5.1), various menu items can be selected.



 DLB Plus                 Dynamic Load Balancer PLUS                01-Sep-1995 
 
 +-------------------------------Master Menu-------------------------------+ 
 |                                                                         | 
 |        Dynamic Load Balancing                 Reports and Graphs        | 
 |  DISPLAY   DLB Display Utility          SREPORTS  Summary Reports...    | 
 |  STATUS    DLB Status Utility           DREPORTS  Detail Reports...     | 
 |                                         GRAPHS    Analysis Graphs...    | 
 |             I/O Sampling                SCAN      Batch I/O Scan        | 
 |  SAMPLE    Sample I/Os and Locks                                        | 
 |  CONS      Consolidate Sample Data             Batch Procedures         | 
 |  SELECT    Select a Data Sample         BATCH     Set Up Batch Stream   | 
 |  LIVE      Live Lock Analysis...        SUBMIT    Submit Batch Stream   | 
 |  AUTO      Automatic Sampling           CANCEL    Cancel Batch Set Up   | 
 |                                                                         | 
 |                                                 Miscellaneous           | 
 |                                         SYSTEM    VMS System Commands   | 
 |                                         EXIT      EXIT DLB Plus         | 
 +-------------------------------------------------------------------------+ 
 
 
 
 
EXIT = Exit                                                \ = Back  HELP = Help

The Master Menu contains the items that:

6.2 Choosing Menu Items

Menu ITEMS are listed on the menu screen. Each item has a short description of what the item does.

To choose a given item, use the "arrow" keys to locate the item to be executed.

An item can also be selected by entering the item name. You need enter only as many characters of the item name as is necessary to distinguish the item from all other items on the menu.

6.3 On-line Help System

Extensive HELP is always available. To get help at any time, just enter the word HELP or press the [HELP] key when prompted for information. The screen clears and the help text is displayed.

In some cases you will be presented with a list of related help topics. If you are interested in a related topic, just enter the topic name when prompted for "Help topic?".

When you have finished getting the help that you need, press [RETURN] or enter EXIT at the "Help topic?" prompt. The help screen is replaced with the screen that was displayed before HELP was entered.

6.4 Exiting and Going Back to Correct Mistakes

From anywhere within DLB Plus you can EXIT the current procedure or go BACK to a previous prompt or menu.

Type the word EXIT or [CTRL/Z] to get out of a menu item, go back to a previous menu, or exit out of DLB Plus altogether (if at the Master Menu).

Press the "\" key to back up to the previous prompt or go back to a previous menu.

You can also use the EXIT DLB Plus menu option to exit out of the DLB Plus Master Menu.

6.5 Aborting a Report or Graph

To abort a report or graph while it is being created, press [CTRL/Z] at any time. You will be returned to a menu screen.

6.6 OpenVMS System Commands Option

Selecting SYSTEM from the Master Menu, allows you to get to the system prompt without having to exit out of DLB Plus.

When this menu item is selected, the screen clears and you are placed at the system prompt. The screen appears as follows:



 
 
 
 
 
 
 
 
Passed to VMS...  Press  RETURN  when done.  TESTDISK:[DLB_TESTER] 
$$ X

The double dollar sign ($$) reminds you that you got to the system prompt from DLB Plus.

When you want to return to the DLB Plus Master Menu, press the [Return] key at the $$ prompt and the menu will be displayed.


Chapter 7
Using the DLB Plus DISPLAY Utility

DLB Plus DISPLAY provides a view of the operation of an OpenVMS system through a dynamic display. It shows the system load graphically on any VTxxx-compatible terminal. Use DLB Plus DISPLAY to compare before and after effects of changes made to the system. Use it as a data source for system analysis. The following values are displayed:

The following section describes DLB Plus DISPLAY and its operation.

7.1 Running DLB Plus DISPLAY

To run DLB Plus DISPLAY, select the DISPLAY menu item from the DLB Plus Master Menu.

The screen clears and you see the display graph. The graph remains blank for one sample interval while statistics are gathered. Then, the screen is updated at 10 second intervals.

Note

To refresh the DLB Plus DISPLAY screen, type [CTRL/W]. This keystroke refreshes the static portions of the display. It can be useful to refresh the display after certain operations, such as after resizing a VAX workstation window.

To write the DLB Plus DISPLAY screen to a file, press [CTRL/F] while DLB Plus DISPLAY is running. This creates a file called SYS$SCRATCH:DLBPLUS_DISPLAY.LIS which is a snapshot of the current DLB Plus DISPLAY screen.

Type [CTRL/Z] to terminate DLB Plus DISPLAY.

Example 7-1 DLB Plus DISPLAY

                           DLB DISPLAY Utility Vx.x              Total Samples : 135                
                                                              Sample Intvl (s) :  10 
(1)Node Name : TTI 
VMS Version : Vx.x 
DLB Version : DLB Vx.x                          (2)Response Index Graphic History     
DLB Op Mode : Ample       |    Cur RI (%):   95         AV RI (%) :  81 
                          |100| 
(3)Total Sys Memory: 65536 | 90|                 X X X     X           X X                 
Free List Size    : 48902 | 80|             X   X X X X   X     X X   X X   X     X X 
% on Free List    :    74 | 70|             X X X X X X X X   X X X   X X   X X X X X       
Modified Lst Size :  3108 | 60|         X   X X X X X X X X X X X X X X X X X X X X X 
% of MPW_HILIMIT  :    31 | 50|         X   X X X X X X X X X X X X X X X X X X X X X        
Memory Recovered  :   132 | 40|   X   X X   X X X X X X X X X X X X X X X X X X X X X       
                          | 30| X X X X X X X X X X X X X X X X X X X X X X X X X X X 
(4)Hard Fault / Sec:     0 | 20| X X X X X X X X X X X X X X X X X X X X X X X X X X X 
Soft Fault / Sec  :    11 | 10| X X X X X X X X X X X X X X X X X X X X X X X X X X X 
HF / TotFlts %    :     0 + - - - - - - - - - - - - - - - - - - - - - - - - - - - - - 
                                (5)QUANTUM :  10      (6)# COM PROCS   :  1 
(7)DLB Cycles Compl:   817         IOTA    :   2        # PFW Procs   :  0       
Total Proc Adj    :     4                              Total # Procs : 20 
Memory Adj Cutoff :  3599   (8)DLB Bias        :  NONE  Ker+Int+MPSyn%: 15 
# Inactive Proc   :     0     Only mem        :  No    Idle Time %   : 30 
Adj Last Cycle    :     0     Only CPU        :  No    Dir IO / Sec  :  1 
Eval Last Cycle   :    20     Adjust All      :  No    Images Activd :  1 

7.2 The DISPLAY Screen

Example 7-1 illustrates the DLB Plus DISPLAY screen after 135 iterations. The header displays the current DLB Plus DISPLAY version number. The last item on the first line is the number of system samples that have occurred since DLB Plus DISPLAY was started.

The response index is an indicator of the overall responsiveness of the system to the CPU and I/O needs of processes. The graph represents the response index over the last 45 sampling intervals. By examining this graph, the user can view system trends. Assuming only moderate new process activity and an active DLB Plus process, the graph shows an increasing response index until a steady state has been achieved.

7.2.1 DLB Plus DISPLAY information

  1. DLB Plus status information. Included is information on:
  2. Response Index Graphic History. This shows response index measured in percentages. DLB Plus uses a vertical bar chart with values of 0 to 100%, each bar representing a sample:
  3. System memory information:
  4. Information on page faults:
  5. Checks critical SYSGEN parameters:
  6. OpenVMS statistics:
  7. DLB Plus work cycle information:
  8. DLB Plus performance options:

7.3 Understanding the DLB Plus Response Index

The response index is an indicator of the overall responsiveness of the system to the CPU and I/O needs of processes.

The response index indicates the following about a system:

Use the response index as a relative indicator of system responsiveness. Pay more attention to the changes that span multiple samplings rather than individual sample values. There are several trends that warrant investigation to determine why the system responsiveness is changing; they are:

Note

The response index is not intended to be used as a measure of response time for processes on a system, and is thus not very useful for comparing one system with another, nor can it compare different loads on the same system.

The response index is calculated using the following performance factors:

The number of processes in COM state is an element of the CPU load on the system. The more processes in the COM state, the longer a given process has to wait to get to use the CPU. In addition, system overhead is incurred due to scheduling activity, quantum end events, etc.

PFW state is a state that a process is in, due to its having to wait to have a page fault resolved by OpenVMS. The more processes in PFW state, the longer a process has to wait to get a page fault resolved. In addition, system overhead is incurred as a result of servicing the page faults.

CPU activity spent in system overhead keeps user processes from executing. This slows down the system.

Hard page faults cause CPU resources to be consumed. In addition, hard page faults cause I/O operations, further slowing down the system.

Direct I/O operations incur system overhead and can cause delays for other processes accessing disks.

Idle time is the reserve capacity the system has to handle processes needing CPU time. The more idle time, the more reserve capacity and thus the greater probability that processes will run quickly.

Free pages are the reserve capacity the system has to handle memory demands. The more free pages, the more reserve capacity. Having more free pages tends to reduce hard page faults.


Previous Next Contents Index