Agent Multihoming Issue

Saiyad Rahim 391 Reputation points
2024-06-14T11:26:23.5566667+00:00

We are in the process of replicating our Groups and Monitors etc from SCOM2012R2 to SCOM2019 for an environment behind a Gateway. (I know we are way behind and plan to go to SCOM22 soon as this migration is complete).

Have been using Kevin Holman;s MP to Multihome 2012 and 2019: https://kevinholman.com/2017/05/09/scom-management-mp-making-a-scom-admins-life-a-little-easier/

So far we have successfully multihomed a lot of servers behind this Gateway to report via new Gateway and have replicated monitors and started email notifications and are now waiting on removing old 2012R2 Agents and updating the 2019 Agents - Question: What is the best way to update the agents , is it to do a UR update or just start with SCOM 2022 update and this will update the agents...or will this not work and have issues?

Multihoming issue:

Have found a bunch of servers that have not reported in SCOM2019. Have double checked SCOM2012R2 and can confirm that both Mgmt Groups are shown on the SCOM Management>SCOM Agent console view:

multihome

Have confirmed that both Mgmt Groups are present in Servers Control Panel>Microsoft Monitoring Agent window. Have Flushed the Agent and can see in Event Log that HealthService is validating RunAs accounts from SCOM 2019:

event log

Have checked old SCOMs Gateway logs but there is no new entry in there since 2020 (most probably when the 2012R2 agent was first installed.)

Could not find any files for this server in SCOM2019 Gateway. Server is not popping up in Pending Mgmt either.

There are a lot of servers that are in this same situation, is there something in the database that can be looked at ?

Has anyone else come across this issue as well?

Operations Manager
Operations Manager
A family of System Center products that provide infrastructure monitoring, help ensure the predictable performance and availability of vital applications, and offer comprehensive monitoring for datacenters and cloud, both private and public.
1,440 questions
Microsoft System Center
Microsoft System Center
A suite of Microsoft systems management products that offer solutions for managing datacenter resources, private clouds, and client devices.
891 questions
0 comments No comments
{count} votes

3 answers

Sort by: Most helpful
  1. Saiyad Rahim 391 Reputation points
    2024-06-17T23:55:25.4066667+00:00

    !!! UPDATE!!!

    Have been digging into this a little deeper and this is what i have come up with so far. Hope someone has got any way forward.

    From a fleet of approx 300 servers only 12 servers belonging to a specific production application stack are not multihoming. If other servers in the same network are multihoming than this is not a firewall or a port issue. Started comparing server configuration between working servers and faulting servers.

    What I found tells me it is a DNS issue. The working Servers are configured with the environments standard Preferred DNS and Alternate DNS IPs

    Whereas the faulting Servers are configured with only one Prefered IP and this IP is not from the standard Prefered or Alternate IPs in this environment.

    There was no DNS entry for this IP but was pingable and resolved to a DNS name...more confusing.

    Had to use this single IP and RDP in to the server.

    Found out that this Server is not connected to the Domain and is configured as a Stand Alone DNS Server for this environment.

    Asked the team and only one person responded that he too found this server while troubleshooting another issue in the environment and says it was setup by a contractor who was brought in to do some security and redundancy piece of work for this Application Stack.

    But again - i haven't seen any documentation.

    Anyway - the DNS records doesn't seem to have been updated for more that 4 years. Do i update them or not -- asked the Application Support SME and he too wasn't sure about this setup as he has come in after the fact. Asked if I could instead go and update the faulting servers to our standard Prefered and Alternate DNS IPs --- again no one wants to make that decision as they dont know what will be the impact or why this setup was in place.

    Checked the Conditional Forwarder for our Domain (which has the Primary SCOM Mgmt server) and that was out of date as well. So have updated the Conditional Forwarder with latest DNS records.

    Restarted Agents on the faulting server and waited hoping that they might come up in the Pending Mgmt view in SCOM.

    It hasn't.

    Looking at the OpsMgr Event Log on the Agent/Faulting Server still shows:

    Event ID:21006 The OpsMgr Connector could not connect to GatewayServer.contoso.com:5723. The error code is 11001L(No such host is known.). Please verify there is network connectivity, the server is running and has registered it's listening port, and there are no firewalls blocking traffic to the destination.

    Again - this doesn't make sense to me as both the Gateway and the faulting servers are in the same network/domain.

    Anyone with any ideas on what i can check in this scenario?

    0 comments No comments

  2. XinGuo-MSFT 15,626 Reputation points
    2024-06-18T02:18:17.9266667+00:00

    Hi,

    Event ID:21006 The OpsMgr Connector could not connect to GatewayServer.contoso.com:5723. The error code is 11001L(No such host is known.). Please verify there is network connectivity, the server is running and has registered it's listening port, and there are no firewalls blocking traffic to the destination.

    The Test-NetConnection cmdlet is a powerful command used in PowerShell to test the network connection on a system. It can be used to diagnose network connectivity issues and to provide information about the network connection status on the host that runs the command.

    Here’s a basic example of how to use it:

    Test-NetConnection -ComputerName GatewayServer.contoso.com
    

    You can also specify a port with the -Port parameter. For example:

    Test-NetConnection -ComputerName GatewayServer.contoso.com -Port 5723
    

  3. Saiyad Rahim 391 Reputation points
    2024-06-23T09:02:02.9533333+00:00

    I have resolved this issue by adding the Gateway Server DNS to this stand alone DNS server and the Servers have popped up in Pending Management. HAve approved Manual Agent Install. will go in SQL and change their IsManuallIstall to 0.

    0 comments No comments