  1. Primary / Failover collectors are pretty standard and help eliminate gaps during regular downtime such as patching. We've started rolling out a few Load Balanced Collector Groups as well for our larger deployments which makes this even easier.
  2. You could probably do it via the API, but that doesn't help you inside the portal. We're looking in to doing this using ServiceNow and attaching the ServiceNow location to all possible UI areas (Resource, Websites, Collectors, Services).
  3. Can you elaborate on your issues with event sources? I quickly cloned the default windows event source to only look for Event ID 1074 in the System log and this is what I triggered an alert on: