Monitoring Best Practices

Overview

This section provides tips for System Admins (see People) who setup and proactively monitor the CloudCenter platform and ensure successful and continuous performance.

CloudCenter Component Versions

The CCO, CCM, and other CloudCenter Architecture  must all run the same CloudCenter version—they cannot run different CloudCenter versions.

VM Monitoring

At the system level, you can monitor your VMs to harvest memory usage, disk usage, and CPU usage. You can use any third-party monitoring tool (for example, Sensu open source monitoring framework) to collect system-level data from clients, to monitor VMs, and to provide statistics on usage. You can choose to use this monitoring framework or any other framework that works for your enterprise.

Based on the statistics harvested while monitoring the CloudCenter platform, admins can get a fair idea of how the platform performance.

System PerformanceRecommended Monitoring Thresholds
MemoryThe free memory is less than 10% on a regular basis
CPUThe CPU exceeds 50% to 100% usage on a consistent basis
DiskThe disk is more that 90% full on a regular basis

Disk Space Monitoring

Sometimes, the CCM VM may run low on disk space due to multiple applications (database applications, OS applications, and so forth) dumping data. To address these kind of disk space issues, you can take the following protective measures:

  • Reclaim the disk space from the root partition.
  • Add additional volume to the CCM and configure your database application (or other applications that dump data) to write data to the additionally configured volume.
  • Set up Nagios monitoring to send automated alerts when certain thresholds are reached.

Examples of Key Services that Must be Monitored

  • CCM
    • Tomcat 8
    • PostgreSQL
    • MongoDB
  • CCO
    • Tomcat 8 – on each instance
    • MongoDB

Additional System Monitoring Pointers

Once you install CloudCenter successfully, ensure the security if your CloudCenter platform and data. If the CloudCenter platform becomes unavailable due to any local safety-related issue, be sure to have a plan in place to monitor, backup, and restore your CloudCenter platform and data.

Every large enterprise uses system monitoring tool(s) to ensure business continuity plans practices are in place. It is the enterprise's responsibility to ensure the following tasks for your instance of the CloudCenter platform:

  • Back up your instance of the CloudCenter database on a regular basis.
  • Monitor your instance of CloudCenter at relevant points crucial to your enterprise.

 

  • No labels