Sign In to Follow Application
View All Documents & Correspondence

System Resource Management Method For Virtual System

Abstract: Specifically, a service menu is set for each user ID to determine an information distribution range/distribution amount according to the service menu. Finally, “vendor lock-in” of an infrastructure (hardware, software) is a typical alternative to be adopted to provide high-quality service at low cost. Pieces of information obtained for each product are temporarily collected, are categorized at the same level into pieces of information for respective purposes (screens), and then are provided for users. This achieves proper capacity planning.

Get Free WhatsApp Updates!
Notices, Deadlines & Correspondence

Patent Information

Application #
Filing Date
23 October 2013
Publication Number
43/2014
Publication Type
INA
Invention Field
COMPUTER SCIENCE
Status
Email
ipo@knspartners.com
Parent Application

Applicants

HITACHI, LTD.
6-6, Marunouchi 1-chome, Chiyoda-ku, Tokyo 100-8280, Japan

Inventors

1. Hideyuki KATO
c/o Hitachi, Ltd., 6-6, Marunouchi 1-chome, Chiyoda-ku, Tokyo 100-8280 Japan
2. Kenji SUMII
c/o Hitachi, Ltd., 6-6, Marunouchi 1-chome, Chiyoda-ku, Tokyo 100-8280 Japan
3. Shingo WAKAMATSU
c/o Hitachi, Ltd., 6-6, Marunouchi 1-chome, Chiyoda-ku, Tokyo 100-8280 Japan
4. Yuki TATEISHI
c/o Hitachi, Ltd., 6-6, Marunouchi 1-chome, Chiyoda-ku, Tokyo 100-8280 Japan
5. Kohei YOSHIKAWA
c/o Hitachi, Ltd., 6-6, Marunouchi 1-chome, Chiyoda-ku, Tokyo 100-8280 Japan
6. Akinao HARADA
c/o Hitachi, Ltd., 6-6, Marunouchi 1-chome, Chiyoda-ku, Tokyo 100-8280 Japan
7. Kazuhiko WATANABE
c/o Hitachi, Ltd., 6-6, Marunouchi 1-chome, Chiyoda-ku, Tokyo 100-8280 Japan
8. Kengo YAMATANI
c/o Hitachi Systems, Ltd., 1-2-1, Osaki, Shinagawa-ku, Tokyo 141-8672, Japan

Specification

CLIAMS:1 A system resource management method for a virtual system used by a plurality of users,
the method comprising:
recording a service menu of service available to each of the users, for each of user IDs that identify the respective users;
recording information on the virtual system according to the service menu in association with a range and amount of the available information;
collecting the information available to the user according to corresponding recorded contents, in response to an information service request from the user; and
outputting the collected information to the user.

2. The system resource management method for a virtual system according to claim 1, wherein the information on the virtual system is generated for each of products constituting the virtual system, and
In the collecting, the information on each of the products is collected for each of the users.

3. The system resource management method for a virtual system according to claim 1 or 2, wherein the collected information is outputted after being categorized into pieces of information for respective purposes.

4. The system resource management method for a virtual system according to claim 3, wherein the information for the respective purposes is associated with screens available in use of the virtual system.
,TagSPECI:BACKGROUND OF THE INVENTION

The present invention relates to system management for a virtual environment and particularly relates to a technique of managing the allocation of resources.
A virtual technique enables abstraction of physical resources (including a server, a storage, a CPU, and a memory) constituting a computer system, achieving a virtual system flexibly configured for each logical resource. Based on this technique, service for usage-based billing can be provided according to a resource usage rate.
In this case, virtual system management requires access authority or specified administrative authority. Regarding the administrative authority, Japanese Patent Laid-Open No. 2005-208999 discloses administrative authority set for the resource of a virtual machine by determining information on the presence or absence of administrative authority for each virtual resource ID. This allows a user to manage the resource of the virtual machine.

SUMMARY OF THE INVENTION

A virtual system is composed of various products. The level (range or depth) of obtained operation information varies among the products. Thus, a user and a system administrator of the virtual system have to process the operation information, assuming that future system users will be unable to process system configurations that are expected to become more complicated or physical resources and logical resources that are expected to increase in a virtual environment.
Operation management software and individual products (hardware, OS, database, etc.) are linked with each other to provide the functions of transversely and efficiently managing resources for virtual system users. Specifically, a service menu is set for each user ID to determine an information distribution range/distribution amount according to the service menu. Finally, “vendor lock-in” of an infrastructure (hardware, software) is a typical alternative to be adopted to provide high-quality service at low cost. Pieces of information obtained for each product are temporarily collected, are categorized at the same level into pieces of information for respective purposes (screens), and then are provided for users. This achieves proper capacity planning that suppresses system investment more than in other companies, leading to reliability that will increase orders in the future.
Precise system monitoring for a request and a flexible resource management function allow a user to prevent a mechanical loss caused by a resource shortage and further optimize system usage cost. Moreover, system administrators and operators can reduce running cost.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows an example of devices/functions according to an embodiment of the present invention.
FIG. 2 shows an example of a management target according to the embodiment of the present invention.
FIG. 3 shows a company management table used in the embodiment of the present invention.
FIG. 4 shows a user management table used in the embodiment of the present invention.
FIG. 5 shows a center management table used in the embodiment of the present invention.
FIG. 6 shows a physical node management table used in the embodiment of the present invention.
FIG. 7 shows a machine specification management table used in the embodiment of the present invention.
FIG. 8 shows a job management table used in the embodiment of the present invention.
FIG. 9 shows a virtual node management table 1 used in the embodiment of the present invention.
FIG. 10 shows a virtual node management table 2.
FIG. 11 shows a virtual node management table 3 used in the embodiment of the present invention.
FIG. 12 shows a service management table used in the embodiment of the present invention.
FIG. 13 shows an operation statistic management table used in the embodiment of the present invention.
FIG. 14 shows a system management table used in the embodiment of the present invention.
FIG. 15 shows a system change/management table used in the embodiment of the present invention.
FIG. 16 shows an operation history management table used in the embodiment of the present invention.
FIG. 17 shows an operation status acquisition flow according to the embodiment of the present invention.
FIG. 18(1) shows an operation status confirmation (edit) flow (1) according to the embodiment of the present invention.
FIG. 18(2) shows an operation status confirmation (edit) flow (2) according to the embodiment of the present invention.
FIG. 19 shows a simulation (resource addition) flow according to the embodiment of the present invention.
FIG. 20 shows a simulation (job addition) flow according to the embodiment of the present invention.
FIG. 21 shows a dynamic monitoring (response) flow according to the embodiment of the present invention.
FIG. 22 shows a dynamic monitoring (resource) flow according to the embodiment of the present invention.
FIG. 23 shows a dynamic monitoring (operation conditions) flow according to the embodiment of the present invention.
FIG. 24 shows a static monitoring flow according to the embodiment of the present invention.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

An embodiment of the present invention will be specifically described below with reference to the accompanying drawings. The embodiment is merely exemplary and thus the present invention is not limited to the following contents.
? Device/function configuration :
FIG. 1 shows devices and functions according to the embodiment of the present invention. A center management node <101> is a node used by an administrator of an overall virtual system. The center management node <101> has the functions of user management <1011>, node management (physical/virtual) <1012>, job execution/management <1013>, failure detection/notification <1014>, operation status analysis/accumulation <1015>, operation status prediction <1016>, and system change/management (physical/virtual) <1017>. Furthermore, provided management tables include a company management table <1018>, a user management table <1019>, a center management table <1020>, a physical node management table <1021>, a machine specification management table <1022>, a job management table <1023>, virtual node management tables 1 to 3 <1024>, a service management table <1025>, an operation statistic management table <1026>, and a system change management table <1027>.
A virtual node <1031> is a system unit that is created based on a virtual resource and has the functions of job execution <1032> and operation status acquisition/notification <1033>. A virtual node <1034> is created across a physical node <103> and a physical node <105>.
For a virtual system user, a user management node <107> has the functions of system management <1071>, system change/management (virtual/system) <1072>, operation status analysis result reference <1073>, and operation status prediction result reference <1074>, a system management table <1075>, and a system change/management table <1076>.
Main processing in the present invention will be described below. Before the processing is started, values are set for the parameters of the company management table , the user management table , the center management table , the physical node management table , the machine specification management table , the job management table , the virtual node management tables 1 to 3 , the service management table , the operation statistic management table , the system management table , the system change/management table , and an operation history management table .
? Operation status acquisition :
An operation status is typically acquired at intervals of 60 seconds for 24 hours. The intervals can be changed by setting a data and time or an interval in an acquisition level <608> in the physical node management table and an acquisition level <1009> in the virtual node management table 2 .
At this point, the center management node specifies a target physical node ID <904> as an information acquisition target from the virtual node management table 1 based on a physical node ID <602> of or a virtual node ID <1003> of the virtual node management table 2.
An operation information acquisition command is issued according to the information acquisition level <1309, 1311, 1313> and the management range <1302, 1303, 1304, 1305, 1306, 1307> of the specified node as has been set in the operation statistic management table . Moreover, a processing (job) unit <201>, a work (job group) unit <202>, a product (system) unit <203>, a virtual OS unit <204>, and a physical server unit <205> can be specified in a management target . Moreover, an operation status is typically acquired at intervals of 60 seconds for 24 hours but may be acquired at varying intervals.
An operation status is acquired for a target node according to the conditions specified in and notifies the center management node of the result .
The center management node stores information received from the target node, in a CPU usage ratio <1310>, a memory usage ratio <1312>, a job processing time <1314>, and a response time <1315> of the operation statistic management table .
? Operation status confirmation (edit) :
A system user acquires a virtual node ID <1404> associated with a user code <1402> from the system management table of the user management node, and issues an operation status acquisition command for the node . Operation statistics are edited in confirmation units specified at the time of the acquisition command in the center management node .
(1) If the confirmation unit is a job ID, the CPU usage ratio <1310> and the memory usage ratio <1312> of a corresponding job are acquired from the operation statistic management table .
(2) If the confirmation unit is a job group ID, job IDs <1307> included in a target job group <1306> are extracted from the operation statistic management table . The CPU usage ratio <1310> and the memory usage ratio <1312> are acquired for each of the extracted job IDs and then are summed .
(3) If the confirmation unit is a system ID, the CPU usage ratio <1310> and the memory usage ratio <1312> of a job corresponding to a system ID <1305> specified from the operation statistic management table are acquired and summed . (4) If the confirmation unit is a virtual node ID, the CPU usage ratio <1310> and the memory usage ratio <1312> of a job corresponding to a virtual node ID <1304> specified from the operation statistic management table are acquired and summed .
(5) If the confirmation unit is a physical node ID, the following processing is performed. For example, processing for a specified physical node A <103> in Figure 1 will be described below .
(A) The CPU usage ratio of a job included in a virtual node A <1031> is determined as described in (4).
(B) The CPU usage ratio of a job included in a virtual node B <1034> is determined as described in (4). An acquisition range is however limited to static information containing the physical node ID <1304> corresponding to the physical node A.
(C) A CPU <604> and a memory <605> of the physical node A are acquired from the physical node management table . A CPU <905> and a memory <906> that are allocated to the virtual node A and the virtual node B on the physical node A are acquired from the virtual node management table 1 .
(D) The CPU usage ratio and the memory usage ratio of the physical node A are calculated by the following equations.
? CPU usage ratio = (A) ? (the CPU of the virtual node A/the CPU of the physical node A) + (B) ? (the CPU of the virtual node B on the physical node A/the CPU of the physical node A)
? Memory usage ratio = (A) ? (the memory of the virtual node A/the memory of the physical node A) + (B) ? (the memory of the virtual node B on the physical node A/the memory of the physical node A)
Operation information calculated in steps <1803, 1804, 1805, 1806, 1807> is displayed on a user management node according to the confirmation unit specified from the user management node . The center management node can also calculate past information in the same confirmation unit. Information on past periods indicated by the user management node can be obtained from the operation statistic management table , and a minimum value, a maximum value, and a mean value in a target period can be calculated and displayed.
? Simulation (resource addition) :
A system ID <1403> or a virtual node ID <1404> is specified from the system management table in the user management node to specify a resource addition target. An operation status of a managed system is confirmed in the preceding operation status confirmation (edit) , and then a resource (CPU, memory) to be added is specified. After that, a simulation command or a resource addition command is issued. The center management node stores specified addition resource information in each column of the system change/management table .
Resource addition instructions are classified to perform the following processing .
(1) Simulation
In the center management node, performance information (a CPU usage ratio <1310>, a memory usage ratio <1312>, a job processing time <1314>, and a response time <1315>) is acquired from the operation statistic table before and after the resource of a target system is changed, and then a performance ratio is calculated .
(A) Throughput (the number of jobs or the number of transactions processed per unit time)
? Throughput = (CPU performance ? usage ratio)/the number of steps in each transaction
The CPU performance is obtained from performance <706> of the machine specification management table while the usage ratio is obtained from a CPU usage ratio <1310> of the operation statistic management table . For the number of steps in each transaction, a numerical value for simulation is set beforehand according to a processing pattern. After a resource is added, a CPU usage ratio is calculated by dividing used CPU by (allocated CPU + added CPU).
(B) Response (a time from the end of input to the start of output)
Response is obtained from a response time <1315> of the operation statistic management table . After a resource is added, a response time is calculated by multiplying a response time <1315> by (a throughput after a resource is added ? a throughput before a resource is added). It is assumed that if an I/F (LAN, FC) is not added, a job processing time after the addition of a resource (from job entry to the end of output) is equivalent to response after the resource of added.
Additionally, after a resource is added, a memory usage ratio is calculated by dividing used memory by (allocated memory + added memory) .
(2) System change (resource addition)
Additional resource information is stored in a CPU <1505> and a memory <1506> in the system change/management table of the center management node. The resource of the target node is changed in a date and time <1507> specified in the center management node. Changed information is reflected to each column of the virtual node management table 1 .
A simulation result or a resource addition result is displayed on the user management node .
Simulation (job addition) :
A developed machine previously executes an additional job to acquire performance information. Information is stored in a job ID <802>, a CPU <810>, and a memory <811> that are the items of the job management table in the center management node .
The system ID <1403> is specified from the system management table in the user management node to specify an additional job. The system ID and the additional job are specified in the user management node, and then a simulation command or a job addition command is issued .
Job addition instructions are classified to perform the following processing .
(1) Simulation
In the center management node, the performance ratio of performance information on the developed machine and a real-operational environment node having the additional job is obtained and calculated from the table below . (The CPU <604> and the memory <605> in the physical node management table or the allocated CPU <905>, the allocated memory <906>, and an IF <911> in the virtual node management table 1 ).
In the center management node, the CPU usage ratio <1310> and the memory usage ratio <1312> of the operation statistic management table are obtained as an operation status of the target system before a job is added, and then a CPU usage ratio, a memory usage ratio, and a job processing time with the additional job are calculated from the performance ratio obtained in step 1903 .
? Simulation value = a performance value before the addition of a resource + a performance value on the developed machine ? the performance ratio of the developed machine and the real-operational environment node
The performance value indicates a CPU usage ratio, a memory usage ratio, and a job processing time.
(2) System change (job addition)
Resource information to be added to the CPU <1505> and the memory <1506> of the system change/management table is stored in the center management node. The resource of the target node is changed in the date and time <1507> specified in the center management node. Changed information is reflected to each column of the virtual node management table 1 .
A simulation result or a resource addition result is displayed on the user management node .:
? Dynamic monitoring (response) :
The center management node compares a time in the response time <1315> obtained at a date and time <1308> in the operation statistic management table and a response threshold value <910> in the virtual node management table 1 .
In the case of operation information within the threshold value, monitoring is completed . If the operation information exceeds the threshold value, information is stored in a user code <1602>, a system ID <1603>, a virtual node ID <1604>, a job ID <1605>, a warning type <1606>, and a date and time <1608> in the operation history management table . The user management node is notified of contents stored in .
Dynamic monitoring (resource) :
The center management node obtains information in the date and time <1308> of the operation statistic management table and stores values in the CPU usage ratio <1310> and the memory usage ratio <1312>. The obtained value is compared with a CPU threshold value <907> and a memory threshold value <908> in the virtual node management table 1 .
In the case of operation information within the threshold value, monitoring is completed .
If the operation information exceeds the threshold value, automatic allocation flags <1405, 1409> in the system management table of the center management node are confirmed .
In modes other than an automatic resource allocation mode, information is stored in the user code <1602>, the system ID <1603>, the virtual node ID <1604>, the job ID <1605>, the warning type <1606>, and the date and time <1608> in the operation history management table . The user management node is notified of the contents .
In the automatic resource allocation mode, an available resource in the physical node to be operated is calculated with reference to the two tables (the CPU usage ratio <1310> and the memory usage ratio <1312> in the operation statistic management table and the CPU <604> and the memory <605> in the physical node management table ) .
An allocated CPU <1408> and an allocated memory <1412> that are set beforehand in the system management table are added to the target node. However, the limit of an allocated capacity is equal to or lower than the free space determined in and an automatic allocation upper limit <1407, 1411> in the system management table . An automatically allocated resource is released after an allocated period <1406, 1410> in the system management table .
Information on automatic allocation records is stored in automatic allocation <1607> and the date and time <1608> in the operation history management table , and then the user management node is notified of the contents .
? Automatic monitoring (operation conditions)
The center management node confirms the operation status of a monitored job under conditions specified by a starting condition <806>, a starting time <807>, an end time <808>, and an earliest starting time <809> in the job management table . When the operation conditions are satisfied, the monitoring is completed .
If the operation conditions are not satisfied, information is stored in the user code <1602>, the system ID <1603>, the virtual node ID <1604>, the job ID <1605>, the warning type <1606>, and the date and time <1608> in the operation history management table . The user management node is notified of the contents stored in .
? Static monitoring :
To confirm inconsistency between the management tables of the center management node and the user management node, the two tables are compared with each other (a system ID <1102> and a virtual node ID <1104> in the virtual node management table 3 and the system ID <1403> and the virtual node ID <1404> in the system management table of the user management node) at a consistency confirmation date and time <1105> in the virtual node management table 3 .
Information on the comparison result is stored in a consistency confirmation result <1106> in the virtual node management table 3 . In the case of consistency, static monitoring is completed . In the case of inconsistency, the center management node and the user management node are notified of inconsistency information . Furthermore, a virtual system administrator carries out research and measures .

Documents