Update 2010-07-21: If you are interested in graphing operations and latency for NFS, CIFS, iSCSI, FCP and/or SAN protocols on a per volume basis, you may want to see this post.
Update 2009-07-21: As requested by Dave, I have posted an FCP template and script. I have no way of testing them (no FCP…), but I think they should work.
I’m one of those people who has to know everything that is going on inside my infrastructure at all times. ESX, vCenter, MySQL, SQL Server, NetApp…I keep close tabs on all of them using both Cacti and Nagios.
Some people might find it strange that I have both running, but the two applications have very different functions. Cacti is superb at trend analysis and detecting abnormalities after the fact (only occasionally during the event). I use it’s data to determine, for example, when is a good time for Exchange to take an outage based on the number of users connected and the number of RPC requests occurring. Nagios, on the other hand, is extremely well suited to real time monitoring and alerting. It checks different data points at intervals and if it finds one out of the accepted range, it tells me.
Anyway, back to the point. I created these graphs to give me detailed information about CIFS, NFS and iSCSI on my NetApp filers. I have used these against FAS270s, FAS2020s and FAS6030s running OnTAP 220.127.116.11, 18.104.22.168 and/or 7.3.1 with success against them all.
In addition to the below templates I use some additional graphs to track other metrics. Those graphs are available here:
As I get time (which is rare) I plan on adding additional graphs, when I do, I will post them here. I would like to get and graph ASIS information, WAFL stats, and space information (raw, formatted, usable, allocated, overhead for the filer as a whole). If anyone has, or knows where to find, these graphs, please let me know!