Update 2010-07-21: If you are interested in graphing operations and latency for NFS, CIFS, iSCSI, FCP and/or SAN protocols on a per volume basis, you may want to see this post.
Update 2009-07-21: As requested by Dave, I have posted an FCP template and script. I have no way of testing them (no FCP…), but I think they should work.
I’m one of those people who has to know everything that is going on inside my infrastructure at all times. ESX, vCenter, MySQL, SQL Server, NetApp…I keep close tabs on all of them using both Cacti and Nagios.
Some people might find it strange that I have both running, but the two applications have very different functions. Cacti is superb at trend analysis and detecting abnormalities after the fact (only occasionally during the event). I use it’s data to determine, for example, when is a good time for Exchange to take an outage based on the number of users connected and the number of RPC requests occurring. Nagios, on the other hand, is extremely well suited to real time monitoring and alerting. It checks different data points at intervals and if it finds one out of the accepted range, it tells me.
Anyway, back to the point. I created these graphs to give me detailed information about CIFS, NFS and iSCSI on my NetApp filers. I have used these against FAS270s, FAS2020s and FAS6030s running OnTAP 126.96.36.199, 188.8.131.52 and/or 7.3.1 with success against them all.
In addition to the below templates I use some additional graphs to track other metrics. Those graphs are available here:
- NetApp Filer: graphing Performance Stats and IO’s (template) (be sure to apply the performance tweak found here)
- NetApp Graphs
As I get time (which is rare) I plan on adding additional graphs, when I do, I will post them here. I would like to get and graph ASIS information, WAFL stats, and space information (raw, formatted, usable, allocated, overhead for the filer as a whole). If anyone has, or knows where to find, these graphs, please let me know!
Shows a detailed breakdown of the different types of CIFS operations that occur. This template relies upon SNMP to gather it’s data, so make sure SNMP is enabled on your Filer.
These templates show NFS version 3 operations and latency for the server as a whole. It is possible to get latency information on a per volume basis, but I haven’t gotten my script for getting this information to a place where I’m happy with it yet (and therefore am not releasing it).
These templates rely upon the NetApp Perl SDK. When I wrote the scripts I wrote them so that it looked in the current directory for a directory called “Netapp” that should contain the NaElement.pm and NaServer.pm files. I have included them in the zip, just place the “Netapp” folder in your scripts directory along with the perl scripts.
After importing the template, when you create the graphs for your filer it will ask you for the username and password to connect via the SDK. Please be aware that this information is stored in the Cacti database in plain text, so make sure that you take a “least privileges necessary” approach when setting up the user (in other words, don’t use the root user…).
Just like with the NFS templates, these use the SDK, so please make sure that you place the Netapp folder in your scripts directory.
FibreChannel, NFSv4 and the lack thereof
I don’t have any FC, so I have no need to monitor it. If someone desires, I can create the templates and graphs, but I have no way to verify they work because I don’t have any of that traffic type. Same goes for NFS version 4…I can create, just can’t verify.
2009-07-21: FCP has been added. They can be downloaded here.