Skip to content
forked from ltpinellia/vsphere

适用于Open-Falcon的VSphere监控agent

Notifications You must be signed in to change notification settings

kk-kwok/vsphere

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

falcon-vsphere

这是一个适用于Open-Falcon的,用于监控Vsphere及由Vsphere监管的所有esxi性能指标的agent。

一.特性

  1. 支持多vsphere同时采集
  2. 支持vsphere与esxi监控项归并/拆分,支持自定义endpoint或监控项头部
  3. esxi监控项包含基础跟扩展监控两类,扩展监控可选择启用,同时可配置
  4. 配置支持热加载(例如添加vsphere,增删扩展监控等等),不需要重新启动agent

二.使用说明

  1. export WorkDir="$HOME/falcon-vsphere"
  2. mkdir -p $WorkDir
  3. tar -xzvf falcon-vsphere-x.x.x.tar.gz -C $WorkDir
  4. cd $WorkDir
  5. ./control start

三.配置说明

{
    "debug": false,
    "extend": "./extend.json",              #扩展监控项列表     
    "heartbeat": { 
        "enabled": true,                    #是否开启HBS
        "addr": "127.0.0.1:6030",           #HBS地址
        "timeout": 1000                     #超时时间
    },
    "transfer": {
        "enabled": true,                    #是否开启Transfer
        "addrs": [              
            "127.0.0.1:8433"                #Transfer地址,可配置多个
        ],
        "interval": 60,                     #上传时间间隔
        "timeout": 1000                     #超时时间          
    },
    "vsphere": [
        {
            "hostname": "VC-1.1.1.1",       #上传的Vsphere Endpoint名
            "ip": "1.1.1.1",                #Vsphere IP
            "addr": "https://1.1.1.1/sdk",  #Vsphere SDK地址
            "user":"vsphere1-user",         #Vsphere 用户名
            "pwd":"vsphere1-pwd",           #Vsphere 密码
            "port": 443,                    #Vsphere 端口
            "split": true,                  #是否切分。如果选择true,那么VC将与esxi的监控项单独出来,esxi将作为单独的endpoint上传,esxi endpoint名称可以添加endpointhead作为头部;如果选择false,那么esxi的监控项将与VC的监控项合并上传,esxi的监控项会收集在VC下面作为一个大的endpoint,此时可以选择配置metrichead作为esxi的监控项头部,以区分vc跟esxi的监控项
            "endpointhead": "ESXI-",        #在split为true时生效,作为esxi endpoint的扩展头部,可为空
            "metrichead":"esxi.",           #在split为false时生效,作为esxi监控项的扩展头部,以区分vc的监控项跟esxi的监控项
            "extend": true                  #是否启用扩展监控
        },
        {
            "hostname": "VC-2.2.2.2",
            "ip": "2.2.2.2",
            "addr": "https://2.2.2.2/sdk",
            "user":"vsphere2-user",
            "pwd":"vsphere2-pwd",
            "port": 443,
            "split": true,
            "endpointhead": "ESXI-",
            "metrichead":"esxi.",
            "extend": true
        }
    ]
}

四.监控项说明

  1. 基础监控项
监控项名称 说明
agent.alive 默认上传1
agent.power 1:关机;2:开机;3:待机;4:未知,表示主机断开连接或者无响应
agent.status 1:状态未知;2:实体没问题;3:实体肯定有问题;4:实体可能有问题
agent.uptime 开机时间
cpu.busy cpu使用百分比
cpu.free.average cpu空闲(单位:HZ)
cpu.total cpu总量(单位:HZ)
cpu.usage.average cpu使用(单位:HZ)
datastore.totalReadLatency.average Average amount of time for a read operation from the datastore. Total latency = kernel latency + device latency.
datastore.totalWriteLatency.average Average amount of time for a write operation to the datastore. Total latency = kernel latency + device latency.
df.bytes.free.percent 存储器空闲比例(单块存储器)
df.bytes.free 存储器空余量(单块存储器)
df.bytes.total 存储器总量(单块存储器)
df.bytes.used.Percent 存储器使用率(单块存储器)
df.bytes.used 存储器使用量(单块存储器)
df.statistics.free 存储器空余量(所有存储器)
df.statistics.free.percent 存储器空闲比例(所有存储器)
df.statistics.total 存储器总量(所有存储器)
df.statistics.used 存储器使用量(所有存储器)
df.statistics.used.percent 存储器使用率(所有存储器)
mem.memfree 内存空闲量
mem.memfree.percent 内存空闲百分比
mem.memtotal 总内存
mem.memused 内存使用量
mem.memused.percent 内存使用百分比
net.bytesRx.average Average amount of data received per second.
net.bytesTx.average Average amount of data transmitted per second.
  1. 扩展监控项

类型声明

NAME DESCRIPTION
absolute Represents an actual value, level, or state of the counter. For example, the “uptime” counter (system group) represents the actual number of seconds since startup. The “capacity” counter represents the actual configured size of the specified datastore. In other words, number of samples, samplingPeriod, and intervals have no bearing on an “absolute” counter“s value.
delta Represents an amount of change for the counter during the samplingPeriod as compared to the previous interval. The first sampling interval
rate Represents a value that has been normalized over the samplingPeriod, enabling values for the same counter type to be compared, regardless of interval. For example, the number of reads per second.
  • hbr类监控项
监控项名称 描述 类型
hbr.hbrNetTx.average Average amount of data transmitted per second rate
hbr.hbrNetRx.average Kilobytes per second of outgoing host-based replication network traffic (for this virtual machine or host) rate
hbr.hbrNumVms.average Number of powered-on virtual machines running on this host that currently have host-based replication protection enabled. absolute
  • rescpu类监控项
监控项名称 描述 类型
rescpu.runav15.latest CPU running average over 15 minutes absolute
rescpu.actav1.latest CPU running average over 1 minute absolute
rescpu.actpk15.latest CPU active peak over 15 minutes absolute
rescpu.actav5.latest CPU running average over 5 minutes absolute
rescpu.runpk1.latest CPU running peak over 1 minute absolute
rescpu.maxLimited5.latest Amount of CPU resources over the limit that were refused, average over 5 minutes absolute
rescpu.actpk1.latest CPU active peak over 1 minute absolute
rescpu.sampleCount.latest Group CPU sample count. absolute
rescpu.samplePeriod.latest Group CPU sample period. absolute
rescpu.maxLimited1.latest Amount of CPU resources over the limit that were refused, average over 1 minute absolute
rescpu.runpk15.latest CPU running peak over 15 minutes absolute
rescpu.maxLimited15.latest Amount of CPU resources over the limit that were refused, average over 15 minutes absolute
rescpu.runav5.latest CPU running average over 5 minutes absolute
rescpu.runav1.latest CPU running average over 1 minute absolute
rescpu.actav15.latest CPU active average over 15 minutes absolute
rescpu.actpk5.latest CPU active peak over 5 minutes absolute
rescpu.runpk5.latest CPU running peak over 5 minutes absolute
  • storagePath类监控项
监控项名称 描述 类型
storagePath.totalReadLatency.average Average amount of time for a read issued on the storage path. Total latency = kernel latency + device latency. absolute
storagePath.commandsAveraged.average Average number of commands issued per second on the storage path during the collection interval rate
storagePath.numberReadAveraged.average Average number of read commands issued per second on the storage path during the collection interval rate
storagePath.write.average Rate of writing data on the storage path rate
storagePath.maxTotalLatency.latest Highest latency value across all storage paths used by the host absolute
storagePath.read.average Rate of reading data on the storage path rate
storagePath.numberWriteAveraged.average Average number of write commands issued per second on the storage path during the collection interval rate
storagePath.totalWriteLatency.average Average amount of time for a write issued on the storage path. Total latency = kernel latency + device latency. absolute
  • storageAdapter类监控项
监控项名称 描述 类型
storageAdapter.read.average Rate of reading data by the storage adapter rate
storageAdapter.numberReadAveraged.average Average number of read commands issued per second by the storage adapter during the collection interval rate
storageAdapter.numberWriteAveraged.average Average number of write commands issued per second by the storage adapter during the collection interval rate
storageAdapter.totalWriteLatency.average Average amount of time for a write operation by the storage adapter. Total latency = kernel latency + device latency. absolute
storageAdapter.totalReadLatency.average Average amount of time for a read operation by the storage adapter. Total latency = kernel latency + device latency. absolute
storageAdapter.commandsAveraged.average Average number of commands issued per second by the storage adapter during the collection interval rate
storageAdapter.write.average Rate of writing data by the storage adapter rate
  • power类监控项
监控项名称 描述 类型
power.power.average Current power usage. rate
power.powerCap.average Maximum allowed power usage. rate
power.energy.summation Total energy used since last stats reset. rate
  • sys类监控项
监控项名称 描述 类型
sys.resourceCpuUsage.average Amount of CPU used by the Service Console and other applications during the interval by the Service Console and other applications. rate
sys.resourceCpuUsage.maximum Amount of CPU used by the Service Console and other applications during the interval by the Service Console and other applications. rate
sys.resourceMemMapped.latest Memory mapped by the system resource group absolute
sys.resourceMemAllocShares.latest Memory allocation shares of the system resource group absolute
sys.resourceMemCow.latest Memory shared by the system resource group absolute
sys.resourceMemShared.latest Memory saved due to sharing by the system resource group absolute
sys.resourceCpuAct1.latest CPU active average over 1 minute of the system resource group absolute
sys.resourceCpuUsage.minimum Amount of CPU used by the Service Console and other applications during the interval by the Service Console and other applications. rate
sys.resourceMemZero.latest Zero filled memory used by the system resource group absolute
sys.resourceCpuAct5.latest CPU active average over 5 minutes of the system resource group absolute
sys.resourceCpuMaxLimited1.latest CPU maximum limited over 1 minute of the system resource group absolute
sys.resourceFdUsage.latest Number of file descriptors used by the system resource group absolute
sys.resourceMemConsumed.latest Memory consumed by the system resource group absolute
sys.resourceCpuRun5.latest CPU running average over 5 minutes of the system resource group absolute
sys.uptime.latest Total time elapsed, in seconds, since last system startup absolute
sys.resourceCpuAllocMin.latest CPU allocation reservation (in MHz) of the system resource group absolute
sys.resourceCpuAllocShares.latest CPU allocation shares of the system resource group absolute
sys.resourceCpuMaxLimited5.latest CPU maximum limited over 5 minutes of the system resource group absolute
sys.resourceMemOverhead.latest Overhead memory consumed by the system resource group absolute
sys.resourceCpuUsage.none Amount of CPU used by the Service Console and other applications during the interval by the Service Console and other applications. rate
sys.resourceCpuRun1.latest CPU running average over 1 minute of the system resource group absolute
sys.resourceMemSwapped.latest Memory swapped out by the system resource group absolute
sys.resourceMemTouched.latest Memory touched by the system resource group absolute
sys.resourceMemAllocMax.latest Memory allocation limit (in KB) of the system resource group absolute
sys.resourceMemAllocMin.latest Memory allocation reservation (in KB) of the system resource group absolute
  • net类监控项
监控项名称 描述 类型
net.unknownProtos.summation Number of frames with unknown protocol received during the sampling interval delta
net.packetsRx.summation Total number of packets received on all virtual machines running on the host. delta
net.received.average The rate at which data is received across each physical NIC instance on the host. rate
net.errorsRx.summation Number of packets with errors received during the sampling interval. delta
net.transmitted.average The rate at which data is transmitted across each physical NIC instance on the host. rate
net.usage.none Sum of data transmitted and received across all physical NIC instances connected to the host. rate
net.bytesRx.average Average amount of data received per second. rate
net.multicastTx.summation Number of multicast packets transmitted during the sampling interval. delta
net.droppedTx.summation Number of transmitted packets dropped during the collection interval. delta
net.usage.minimum Sum of data transmitted and received across all physical NIC instances connected to the host. rate
net.multicastRx.summation Number of multicast packets received during the sampling interval. delta
net.bytesTx.average Average amount of data transmitted per second. rate
net.broadcastRx.summation Number of broadcast packets received during the sampling interval. delta
net.packetsTx.summation Number of packets transmitted across each physical NIC instance on the host. delta
net.errorsTx.summation Number of packets with errors transmitted during the sampling interval. delta
net.broadcastTx.summation Number of broadcast packets transmitted during the sampling interval. delta
net.usage.maximum Sum of data transmitted and received across all physical NIC instances connected to the host. rate
net.usage.average Sum of data transmitted and received across all physical NIC instances connected to the host. rate
net.droppedRx.summation Number of received packets dropped during the collection interval. delta
  • disk类监控项
监控项名称 描述 类型
disk.usage.maximum Aggregated disk I/O rate. For hosts, this metric includes the rates for all virtual machines running on the host during the collection interval. rate
disk.deviceWriteLatency.average Average amount of time, in milliseconds, to write to the physical device absolute
disk.queueWriteLatency.average Average amount of time spent in the VMkernel queue, per SCSI write command, during the collection interval absolute
disk.commands.summation Number of SCSI commands issued during the collection interval delta
disk.busResets.summation Number of SCSI-bus reset commands issued during the collection interval delta
disk.numberRead.summation Number of times data was read from each LUN on the host. delta
disk.deviceLatency.average Average amount of time, in milliseconds, to complete a SCSI command from the physical device absolute
disk.deviceReadLatency.average Average amount of time, in milliseconds, to read from the physical device absolute
disk.usage.average Aggregated disk I/O rate. For hosts, this metric includes the rates for all virtual machines running on the host during the collection interval. rate
disk.numberWriteAveraged.average Average number of write commands issued per second to the datastore during the collection interval. rate
disk.usage.none Aggregated disk I/O rate. For hosts, this metric includes the rates for all virtual machines running on the host during the collection interval. rate
disk.totalWriteLatency.average Average amount of time taken during the collection interval to process a SCSI write command issued by the guest OS to the virtual machine absolute
disk.queueLatency.average Average amount of time spent in the VMkernel queue, per SCSI command, during the collection interval. absolute
disk.kernelLatency.average Average amount of time, in milliseconds, spent by VMkernel to process each SCSI command absolute
disk.write.average Rate at which data is written to each LUN on the host.write rate = # blocksRead per second x blockSize rate
disk.numberWrite.summation Number of times data was written to each LUN on the host. delta
disk.commandsAveraged.average Average number of SCSI commands issued per second during the collection interval rate
disk.totalLatency.average Average amount of time taken during the collection interval to process a SCSI command issued by the guest OS to the virtual machine. absolute
disk.totalReadLatency.average Average amount of time taken during the collection interval to process a SCSI read command issued from the guest OS to the virtual machine absolute
disk.kernelWriteLatency.average Average amount of time, in milliseconds, spent by VMkernel to process each SCSI write command absolute
disk.maxTotalLatency.latest Highest latency value across all disks used by the host. Latency measures the time taken to process a SCSI command issued by the guest OS to the virtual machine. The kernel latency is the time VMkernel takes to process an IO request. The device latency is the time it takes the hardware to handle the request. absolute
disk.numberReadAveraged.average Number of times data was read from each LUN on the host. delta
disk.read.average Rate at which data is read from each LUN on the host.read rate = # blocksRead per second x blockSize rate
disk.queueReadLatency.average Average amount of time spent in the VMkernel queue, per SCSI read command, during the collection interval absolute
disk.usage.minimum Aggregated disk I/O rate. For hosts, this metric includes the rates for all virtual machines running on the host during the collection interval. rate
disk.commandsAborted.summation Number of SCSI commands aborted during the collection interval delta
disk.maxQueueDepth.average Maximum queue depth. absolute
disk.kernelReadLatency.average Average amount of time, in milliseconds, spent by VMkernel to process each SCSI read command absolute
  • cpu类监控项
监控项名称 描述 类型
cpu.reservedCapacity.average Total CPU capacity reserved by virtual machines absolute
cpu.usagemhz.average Sum of the actively used CPU of all powered on virtual machines on a host. The maximum possible value is the frequency of the processors multiplied by the number of processors. For example, if you have a host with four 2GHz CPUs running a virtual machine that is using 4000MHz, the host is using two CPUs completely.4000 / (4 x 2000) = 0.50 rate
cpu.usage.none Actively used CPU of the host, as a percentage of the total available CPU. Active CPU is approximately equal to the ratio of the used CPU to the available CPU. available CPU = # of physical CPUs x clock rate.100% represents all CPUs on the host. For example, if a four-CPU host is running a virtual machine with two CPUs, and the usage is 50%, the host is using two CPUs completely. rate
cpu.coreUtilization.maximum CPU utilization of the corresponding core (if hyper-threading is enabled) as a percentage during the interval (A core is utilized if either or both of its logical CPUs are utilized). rate
cpu.costop.summation Time the virtual machine is ready to run, but is unable to run due to co-scheduling constraints delta
cpu.totalCapacity.average Total CPU capacity reserved by and available for virtual machines absolute
cpu.latency.average Percent of time the virtual machine is unable to run because it is contending for access to the physical CPU(s) rate
cpu.usage.average Actively used CPU of the host, as a percentage of the total available CPU. Active CPU is approximately equal to the ratio of the used CPU to the available CPU. available CPU = # of physical CPUs x clock rate.100% represents all CPUs on the host. For example, if a four-CPU host is running a virtual machine with two CPUs, and the usage is 50%, the host is using two CPUs completely. rate
cpu.utilization.maximum CPU utilization as a percentage during the interval (CPU usage and CPU utilization might be different due to power management technologies or hyper-threading) rate
cpu.coreUtilization.minimum CPU utilization of the corresponding core (if hyper-threading is enabled) as a percentage during the interval (A core is utilized if either or both of its logical CPUs are utilized). rate
cpu.wait.summation Total CPU time spent in wait state.The wait total includes time spent the CPU Idle, CPU Swap Wait, and CPU I/O Wait states. rate
cpu.swapwait.summation CPU time spent waiting for swap-in. delta
cpu.ready.summation Time that the virtual machine was ready, but could not get scheduled to run on the physical CPU during last measurement interval. CPU ready time is dependent on the number of virtual machines on the host and their CPU loads. delta
cpu.utilization.average CPU utilization as a percentage during the interval (CPU usage and CPU utilization might be different due to power management technologies or hyper-threading) rate
cpu.used.summation Time accounted to the virtual machine. If a system service runs on behalf of this virtual machine, the time spent by that service (represented by cpu.system) should be charged to this virtual machine. If not, the time spent (represented by cpu.overlap) should not be charged against this virtual machine. delta
cpu.utilization.none CPU utilization as a percentage during the interval (CPU usage and CPU utilization might be different due to power management technologies or hyper-threading) rate
cpu.idle.summation Total time that the CPU spent in an idle state delta
cpu.coreUtilization.none CPU utilization of the corresponding core (if hyper-threading is enabled) as a percentage during the interval (A core is utilized if either or both of its logical CPUs are utilized). rate
cpu.coreUtilization.average CPU utilization of the corresponding core (if hyper-threading is enabled) as a percentage during the interval (A core is utilized if either or both of its logical CPUs are utilized). rate
cpu.readiness.average Percentage of time that the virtual machine was ready, but could not get scheduled to run on the physical CPU. rate
cpu.usage.maximum Actively used CPU of the host, as a percentage of the total available CPU. Active CPU is approximately equal to the ratio of the used CPU to the available CPU. available CPU = # of physical CPUs x clock rate.100% represents all CPUs on the host. For example, if a four-CPU host is running a virtual machine with two CPUs, and the usage is 50%, the host is using two CPUs completely. rate
cpu.usagemhz.none Host - Sum of the actively used CPU of all powered on virtual machines on a host. The maximum possible value is the frequency of the processors multiplied by the number of processors. For example, if you have a host with four 2GHz CPUs running a virtual machine that is using 4000MHz, the host is using two CPUs completely.4000 / (4 x 2000) = 0.50 rate
cpu.usage.minimum Actively used CPU of the host, as a percentage of the total available CPU. Active CPU is approximately equal to the ratio of the used CPU to the available CPU. available CPU = # of physical CPUs x clock rate.100% represents all CPUs on the host. For example, if a four-CPU host is running a virtual machine with two CPUs, and the usage is 50%, the host is using two CPUs completely. rate
cpu.demand.average The amount of CPU resources a virtual machine would use if there were no CPU contention or CPU limit absolute
cpu.usagemhz.maximum Host - Sum of the actively used CPU of all powered on virtual machines on a host. The maximum possible value is the frequency of the processors multiplied by the number of processors. For example, if you have a host with four 2GHz CPUs running a virtual machine that is using 4000MHz, the host is using two CPUs completely.4000 / (4 x 2000) = 0.50 rate
cpu.utilization.minimum CPU utilization as a percentage during the interval (CPU usage and CPU utilization might be different due to power management technologies or hyper-threading) rate
cpu.usagemhz.minimum Host - Sum of the actively used CPU of all powered on virtual machines on a host. The maximum possible value is the frequency of the processors multiplied by the number of processors. For example, if you have a host with four 2GHz CPUs running a virtual machine that is using 4000MHz, the host is using two CPUs completely.4000 / (4 x 2000) = 0.50 rate
  • mem类监控项
监控项名称 描述 类型
mem.llSwapOut.maximum Amount of memory swapped-out to host cache absolute
mem.swapin.maximum Sum of swapin values for all powered-on virtual machines on the host. bsolute
mem.compressed.average Amount of memory reserved by userworlds. absolute
mem.overhead.none Total of all overhead metrics for powered-on virtual machines, plus the overhead of running vSphere services on the host. absolute
mem.heap.maximum VMkernel virtual address space dedicated to VMkernel main heap and related data absolute
mem.unreserved.none Amount of memory that is unreserved. Memory reservation not used by the Service Console, VMkernel, vSphere services and other powered on VMs’ user-specified memory reservations and overhead memory. This statistic is no longer relevant to virtual machine admission control, as reservations are now handled through resource pools. absolute
mem.reservedCapacity.average Total amount of memory reservation used by powered-on virtual machines and vSphere services on the host. absolute
mem.swapoutRate.average Rate at which memory is being swapped from active memory to disk during the current interval. This counter applies to virtual machines and is generally more useful than the swapout counter to determine if the virtual machine is running slow due to swapping, especially when looking at real-time statistics. rate
mem.vmmemctl.none The sum of all vmmemctl values for all powered-on virtual machines, plus vSphere services on the host. If the balloon target value is greater than the balloon value, the VMkernel inflates the balloon, causing more virtual machine memory to be reclaimed. If the balloon target value is less than the balloon value, the VMkernel deflates the balloon, which allows the virtual machine to consume additional memory if needed. absolute
mem.vmfs.pbc.sizeMax.latest Maximum size the VMFS Pointer Block Cache can grow to absolute
mem.consumed.maximum Amount of machine memory used on the host. Consumed memory includes Includes memory used by the Service Console, the VMkernel, vSphere services, plus the total consumed metrics for all running virtual machines. host consumed memory = total host memory - free host memory absolute
mem.swapin.minimum Sum of swapin values for all powered-on virtual machines on the host. absolute
mem.vmmemctl.maximum The sum of all vmmemctl values for all powered-on virtual machines, plus vSphere services on the host. If the balloon target value is greater than the balloon value, the VMkernel inflates the balloon, causing more virtual machine memory to be reclaimed. If the balloon target value is less than the balloon value, the VMkernel deflates the balloon, which allows the virtual machine to consume additional memory if needed. absolute
mem.llSwapUsed.maximum Space used for caching swapped pages in the host cache absolute
mem.sharedcommon.average Amount of machine memory that is shared by all powered-on virtual machines and vSphere services on the host.Subtract this metric from the shared metric to gauge how much machine memory is saved due to sharing: shared - sharedcommon = machine memory (host memory) savings (KB) absolute
mem.active.minimum Sum of all active metrics for all powered-on virtual machines plus vSphere services (such as COS, vpxa) on the host. absolute
mem.sysUsage.none Amount of host physical memory used by VMkernel for core functionality, such as device drivers and other internal uses. Does not include memory used by virtual machines or vSphere services. absolute
mem.swapin.average Sum of swapin values for all powered-on virtual machines on the host. absolute
mem.zero.none Sum of zero metrics for all powered-on virtual machines, plus vSphere services on the host. absolute
mem.llSwapOut.average Amount of memory swapped-out to host cache absolute
mem.granted.minimum Sum of all granted metrics for all powered-on virtual machines, plus machine memory for vSphere services on the host. absolute
mem.granted.none Sum of all granted metrics for all powered-on virtual machines, plus machine memory for vSphere services on the host. absolute
mem.consumed.minimum Amount of machine memory used on the host. Consumed memory includes Includes memory used by the Service Console, the VMkernel, vSphere services, plus the total consumed metrics for all running virtual machines.host consumed memory = total host memory - free host memory absolute
mem.vmmemctl.minimum The sum of all vmmemctl values for all powered-on virtual machines, plus vSphere services on the host. If the balloon target value is greater than the balloon value, the VMkernel inflates the balloon, causing more virtual machine memory to be reclaimed. If the balloon target value is less than the balloon value, the VMkernel deflates the balloon, which allows the virtual machine to consume additional memory if needed. absolute
mem.consumed.average Amount of machine memory used on the host. Consumed memory includes Includes memory used by the Service Console, the VMkernel, vSphere services, plus the total consumed metrics for all running virtual machines.host consumed memory = total host memory - free host memory absolute
mem.active.none Sum of all active metrics for all powered-on virtual machines plus vSphere services (such as COS, vpxa) on the host. absolute
mem.vmfs.pbc.overhead.latest Amount of VMFS heap used by the VMFS PB Cache absolute
mem.swapout.average Sum of swapout metrics from all powered-on virtual machines on the host. absolute
mem.usage.minimum Percentage of available machine memory:consumed ÷ machine-memory-size absolute
mem.consumed.none Amount of machine memory used on the host. Consumed memory includes Includes memory used by the Service Console, the VMkernel, vSphere services, plus the total consumed metrics for all running virtual machines.host consumed memory = total host memory - free host memory absolute
mem.heapfree.none Free address space in the VMkernel main heap.Varies based on number of physical devices and configuration options. There is no direct way for the user to increase or decrease this statistic. For informational purposes only: not useful for performance monitoring. absolute
mem.vmfs.pbc.size.latest Space used for holding VMFS Pointer Blocks in memory absolute
mem.llSwapOut.none Amount of memory swapped-out to host cache absolute
mem.totalCapacity.average Total amount of memory reservation used by and available for powered-on virtual machines and vSphere services on the host absolute
mem.sysUsage.minimum Amount of host physical memory used by VMkernel for core functionality, such as device drivers and other internal uses. Does not include memory used by virtual machines or vSphere services. absolute
mem.granted.average Sum of all granted metrics for all powered-on virtual machines, plus machine memory for vSphere services on the host. absolute
mem.activewrite.average Estimate for the amount of memory actively being written to by the virtual machine. absolute
mem.unreserved.average Amount of memory that is unreserved. Memory reservation not used by the Service Console, VMkernel, vSphere services and other powered on VMs’ user-specified memory reservations and overhead memory. This statistic is no longer relevant to virtual machine admission control, as reservations are now handled through resource pools. absolute
mem.llSwapIn.average Amount of memory swapped-in from host cache absolute
mem.overhead.maximum Total of all overhead metrics for powered-on virtual machines, plus the overhead of running vSphere services on the host. absolute
mem.llSwapOut.minimum Amount of memory swapped-out to host cache absolute
mem.shared.none Sum of all shared metrics for all powered-on virtual machines, plus amount for vSphere services on the host. The host's shared memory may be larger than the amount of machine memory if memory is overcommitted (the aggregate virtual machine configured memory is much greater than machine memory). The value of this statistic reflects how effective transparent page sharing and memory overcommitment are for saving machine memory. absolute
mem.usage.average Percentage of available machine memory:consumed ÷ machine-memory-size absolute
mem.llSwapInRate.average Rate at which memory is being swapped from host cache into active memory rate
mem.sysUsage.average Amount of host physical memory used by VMkernel for core functionality, such as device drivers and other internal uses. Does not include memory used by virtual machines or vSphere services. absolute
mem.lowfreethreshold.average Threshold of free host physical memory below which ESX/ESXi will begin reclaiming memory from virtual machines through ballooning and swapping absolute
mem.zero.average Sum of zero metrics for all powered-on virtual machines, plus vSphere services on the host. absolute
mem.vmfs.pbc.capMissRatio.latest Trailing average of the ratio of capacity misses to compulsory misses for the VMFS PB Cache absolute
mem.state.latest One of four threshold levels representing the percentage of free memory on the host. The counter value determines swapping and ballooning behavior for memory reclamation.0 (high) Free memory >= 6% of machine memory minus Service Console memory;1 (soft) 4%;2 (hard) 2%; 3 (low) 1%; 0 (high) and 1 (soft): Ballooning is favored over swapping.2 (hard) and 3 (low): Swapping is favored over ballooning. absolute
mem.heap.average VMkernel virtual address space dedicated to VMkernel main heap and related data absolute
mem.heapfree.average Free address space in the VMkernel main heap.Varies based on number of physical devices and configuration options. There is no direct way for the user to increase or decrease this statistic. For informational purposes only: not useful for performance monitoring. absolute
mem.granted.maximum Sum of all granted metrics for all powered-on virtual machines, plus machine memory for vSphere services on the host. absolute
mem.llSwapIn.maximum Amount of memory swapped-in from host cache absolute
mem.llSwapUsed.none Space used for caching swapped pages in the host cache absolute
mem.swapout.minimum Sum of swapout metrics from all powered-on virtual machines on the host. absolute
mem.active.average Sum of all active metrics for all powered-on virtual machines plus vSphere services (such as COS, vpxa) on the host. absolute
mem.llSwapIn.minimum Amount of memory swapped-in from host cache absolute
mem.overhead.minimum Total of all overhead metrics for powered-on virtual machines, plus the overhead of running vSphere services on the host. absolute
mem.llSwapUsed.minimum Space used for caching swapped pages in the host cache absolute
mem.swapinRate.average Rate at which memory is swapped from disk into active memory during the interval. This counter applies to virtual machines and is generally more useful than the swapin counter to determine if the virtual machine is running slow due to swapping, especially when looking at real-time statistics. rate
mem.heapfree.maximum Free address space in the VMkernel main heap.Varies based on number of physical devices and configuration options. There is no direct way for the user to increase or decrease this statistic. For informational purposes only: not useful for performance monitoring. absolute
mem.shared.average Sum of all shared metrics for all powered-on virtual machines, plus amount for vSphere services on the host. The host's shared memory may be larger than the amount of machine memory if memory is overcommitted (the aggregate virtual machine configured memory is much greater than machine memory). The value of this statistic reflects how effective transparent page sharing and memory overcommitment are for saving machine memory. absolute
mem.zero.maximum Sum of zero metrics for all powered-on virtual machines, plus vSphere services on the host. absolute
mem.swapout.maximum Sum of swapout metrics from all powered-on virtual machines on the host. absolute
mem.swapin.none Sum of swapin values for all powered-on virtual machines on the host. absolute
mem.heap.minimum VMkernel virtual address space dedicated to VMkernel main heap and related data absolute
mem.decompressionRate.average Rate of memory decompression for the virtual machine. rate
mem.compressionRate.average Rate of memory compression for the virtual machine. rate
mem.shared.minimum Sum of all shared metrics for all powered-on virtual machines, plus amount for vSphere services on the host. The host's shared memory may be larger than the amount of machine memory if memory is overcommitted (the aggregate virtual machine configured memory is much greater than machine memory). The value of this statistic reflects how effective transparent page sharing and memory overcommitment are for saving machine memory. absolute
mem.unreserved.maximum Amount of memory that is unreserved. Memory reservation not used by the Service Console, VMkernel, vSphere services and other powered on VMs’ user-specified memory reservations and overhead memory. This statistic is no longer relevant to virtual machine admission control, as reservations are now handled through resource pools. absolute
mem.zero.minimum Sum of zero metrics for all powered-on virtual machines, plus vSphere services on the host. absolute
mem.llSwapIn.none Amount of memory swapped-in from host cache absolute
mem.swapused.minimum Amount of memory that is used by swap. Sum of memory swapped of all powered on VMs and vSphere services on the host. absolute
mem.sharedcommon.minimum Amount of machine memory that is shared by all powered-on virtual machines and vSphere services on the host.Subtract this metric from the shared metric to gauge how much machine memory is saved due to sharing: shared - sharedcommon = machine memory (host memory) savings (KB) absolute
mem.active.maximum Sum of all active metrics for all powered-on virtual machines plus vSphere services (such as COS, vpxa) on the host. absolute
mem.vmfs.pbc.workingSet.latest Amount of file blocks whose addresses are cached in the VMFS PB Cache absolute
mem.latency.average Percentage of time the virtual machine is waiting to access swapped or compressed memory absolute
mem.unreserved.minimum Amount of memory that is unreserved. Memory reservation not used by the Service Console, VMkernel, vSphere services and other powered on VMs’ user-specified memory reservations and overhead memory. This statistic is no longer relevant to virtual machine admission control, as reservations are now handled through resource pools. absolute
mem.vmfs.pbc.workingSetMax.latest Maximum amount of file blocks whose addresses are cached in the VMFS PB Cache absolute
mem.llSwapUsed.average Space used for caching swapped pages in the host cache absolute
mem.sharedcommon.maximum Amount of machine memory that is shared by all powered-on virtual machines and vSphere services on the host.Subtract this metric from the shared metric to gauge how much machine memory is saved due to sharing:shared - sharedcommon = machine memory (host memory) savings (KB) absolute
mem.swapout.none Sum of swapout metrics from all powered-on virtual machines on the host. absolute
mem.swapused.maximum Amount of memory that is used by swap. Sum of memory swapped of all powered on VMs and vSphere services on the host. absolute
mem.usage.none Percentage of available machine memory:consumed ÷ machine-memory-size absolute
mem.heapfree.minimum Free address space in the VMkernel main heap.Varies based on number of physical devices and configuration options. There is no direct way for the user to increase or decrease this statistic. For informational purposes only: not useful for performance monitoring. absolute
mem.heap.none VMkernel virtual address space dedicated to VMkernel main heap and related data absolute
mem.vmmemctl.average The sum of all vmmemctl values for all powered-on virtual machines, plus vSphere services on the host. If the balloon target value is greater than the balloon value, the VMkernel inflates the balloon, causing more virtual machine memory to be reclaimed. If the balloon target value is less than the balloon value, the VMkernel deflates the balloon, which allows the virtual machine to consume additional memory if needed. absolute
mem.sysUsage.maximum Amount of host physical memory used by VMkernel for core functionality, such as device drivers and other internal uses. Does not include memory used by virtual machines or vSphere services. absolute
mem.llSwapOutRate.average Rate at which memory is being swapped from active memory to host cache rate
mem.swapused.none Amount of memory that is used by swap. Sum of memory swapped of all powered on VMs and vSphere services on the host. absolute
mem.overhead.average Total of all overhead metrics for powered-on virtual machines, plus the overhead of running vSphere services on the host. absolute
mem.swapused.average Amount of memory that is used by swap. Sum of memory swapped of all powered on VMs and vSphere services on the host. absolute
mem.shared.maximum Sum of all shared metrics for all powered-on virtual machines, plus amount for vSphere services on the host. The host's shared memory may be larger than the amount of machine memory if memory is overcommitted (the aggregate virtual machine configured memory is much greater than machine memory). The value of this statistic reflects how effective transparent page sharing and memory overcommitment are for saving machine memory. absolute
mem.sharedcommon.none Amount of machine memory that is shared by all powered-on virtual machines and vSphere services on the host.Subtract this metric from the shared metric to gauge how much machine memory is saved due to sharing: shared - sharedcommon = machine memory (host memory) savings (KB) absolute
mem.usage.maximum Percentage of available machine memory:consumed ÷ machine-memory-size absolute
  • datastore类监控项
监控项名称 描述 类型
datastore.datastoreWriteBytes.latest Storage DRS datastore bytes written absolute
datastore.datastoreIops.average Average amount of time for an I/O operation to the datastore or LUN across all ESX hosts accessing it. absolute
datastore.siocActiveTimePercentage.average Percentage of time Storage I/O Control actively controlled datastore latency absolute
datastore.numberWriteAveraged.average Average number of write commands issued per second to the datastore during the collection interval rate
datastore.totalWriteLatency.average Average amount of time for a write operation to the datastore. Total latency = kernel latency + device latency. absolute
datastore.datastoreNormalWriteLatency.latest Storage DRS datastore normalized write latency absolute
datastore.sizeNormalizedDatastoreLatency.average Storage I/O Control size-normalized I/O latency absolute
datastore.datastoreVMObservedLatency.latest The average datastore latency as seen by virtual machines absolute
datastore.datastoreReadLoadMetric.latest Storage DRS datastore metric for read workload model absolute
datastore.datastoreMaxQueueDepth.latest Storage I/O Control datastore maximum queue depth absolute
datastore.maxTotalLatency.latest Highest latency value across all datastores used by the host absolute
datastore.datastoreReadIops.latest Storage DRS datastore read I/O rate absolute
datastore.datastoreReadBytes.latest Storage DRS datastore bytes read absolute
datastore.datastoreWriteIops.latest Storage DRS datastore write I/O rate absolute
datastore.datastoreWriteLoadMetric.latest Storage DRS datastore metric for write workload model absolute
datastore.datastoreWriteOIO.latest Storage DRS datastore outstanding write requests absolute
datastore.datastoreReadOIO.latest Storage DRS datastore outstanding read requests absolute
datastore.numberReadAveraged.average Average number of read commands issued per second to the datastore during the collection interval rate
datastore.totalReadLatency.average Average amount of time for a read operation from the datastore. Total latency = kernel latency + device latency. absolute
datastore.read.average Rate of reading data from the datastore rate
datastore.datastoreNormalReadLatency.latest Storage DRS datastore normalized read latency absolute
datastore.write.average Rate of writing data to the datastore absolute

About

适用于Open-Falcon的VSphere监控agent

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Go 94.1%
  • Shell 5.9%