使用 az networkcloud baremetalmachine run-data-extract
命令針對裸機計算機問題進行疑難解答
在某些情況下,使用者可能需要調查並解決內部部署裸機計算機的問題。 Azure 操作員 Nexus 會透過 az networkcloud baremetalmachine run-data-extract
提供一組指定的數據擷取命令。 這些命令可讓使用者從裸機計算機取得診斷數據。
此命令會產生輸出檔案,其中包含位於叢集管理員 Azure 儲存體 帳戶中數據擷取的結果。
開始之前
- 本文假設您已安裝 Azure 命令行介面和
networkcloud
命令行介面延伸模組。 如需詳細資訊,請參閱 如何安裝 CLI 延伸模組。 - 目標裸機機器已開啟,且 readyState 已設定為 True。
- 這些命令的語法是以 CLI 的
az networkcloud
0.3.0+ 版本為基礎。 - 取得您為叢集資源建立的叢集受控資源組名 (cluster_MRG)。
執行執行命令
執行數據擷取命令會執行一或多個預先定義的腳本,以從裸機計算機擷取數據。
目前支援的命令清單為
Dell 疑難解答的 SupportAssist/TSR 集合
命令名稱:hardware-support-data-collection
自變數:要求的記錄類型SysInfo
- 系統資訊TTYLog
- 儲存體 TTYLog 數據Debug
- 偵錯記錄
收集 適用於端點的 Microsoft Defender (MDE) 代理程式資訊
命令名稱:mde-agent-information
自變數:無收集 MDE 診斷支持記錄
命令名稱:mde-support-diagnostics
自變數:無收集 Dell 硬體匯總狀態
命令名稱:hardware-rollup-status
自變數:無
命令語法為:
az networkcloud baremetalmachine run-data-extract --name "<machine-name>" \
--resource-group "<cluster_MRG>" \
--subscription "<subscription>" \
--commands '[{"arguments":["<arg1>","<arg2>"],"command":"<command1>"}]' \
--limit-time-seconds "<timeout>"
在選項中使用 --commands
json 格式指定多個命令。 每個 command
都會指定命令和自變數。 針對具有多個自變數的命令,提供 做為參數的清單 arguments
。 如需建構--commands
結構的指示,請參閱 Azure CLI 速記。
這些命令可以長時間執行,因此建議設定為 --limit-time-seconds
至少 600 秒(10 分鐘)。 選項 Debug
或執行多個擷取可能需要超過 10 分鐘的時間。
在回應中,作業會以異步方式執行,並傳回 HTTP 狀態代碼 202。 如需如何追蹤命令完成和檢視輸出檔案的詳細資訊,請參閱檢視輸出一節。
硬體支持數據收集
此範例會執行 命令, hardware-support-data-collection
並從 Dell Server 取得 SysInfo
和 TTYLog
記錄。 腳本會在 racadm supportassist collect
指定的裸機電腦上執行命令。 產生的tar.gz檔案包含 中的 hardware-support-data-<timestamp>.zip
壓縮解壓縮命令檔輸出。
az networkcloud baremetalmachine run-data-extract --name "bareMetalMachineName" \
--resource-group "cluster_MRG" \
--subscription "subscription" \
--commands '[{"arguments":["SysInfo", "TTYLog"],"command":"hardware-support-data-collection"}]' \
--limit-time-seconds 600
hardware-support-data-collection
輸出
====Action Command Output====
Executing hardware-support-data-collection command
Getting following hardware support logs: SysInfo,TTYLog
Job JID_814372800396 is running, waiting for it to complete ...
Job JID_814372800396 Completed.
---------------------------- JOB -------------------------
[Job ID=JID_814372800396]
Job Name=SupportAssist Collection
Status=Completed
Scheduled Start Time=[Not Applicable]
Expiration Time=[Not Applicable]
Actual Start Time=[Thu, 13 Apr 2023 20:54:40]
Actual Completion Time=[Thu, 13 Apr 2023 20:59:51]
Message=[SRV088: The SupportAssist Collection Operation is completed successfully.]
Percent Complete=[100]
----------------------------------------------------------
Deleting Job JID_814372800396
Collection successfully exported to /hostfs/tmp/runcommand/hardware-support-data-2023-04-13T21:00:01.zip
================================
Script execution result can be found in storage account:
https://cm2p9bctvhxnst.blob.core.windows.net/bmm-run-command-output/dd84df50-7b02-4d10-a2be-46782cbf4eef-action-bmmdataextcmd.tar.gz?se=2023-04-14T01%3A00%3A15Zandsig=ZJcsNoBzvOkUNL0IQ3XGtbJSaZxYqmtd%2BM6rmxDFqXE%3Dandsp=randspr=httpsandsr=bandst=2023-04-13T21%3A00%3A15Zandsv=2019-12-12
收集的硬體支援檔案範例清單
Archive: TSR20240227164024_FM56PK3.pl.zip
creating: tsr/hardware/
creating: tsr/hardware/spd/
creating: tsr/hardware/sysinfo/
creating: tsr/hardware/sysinfo/inventory/
inflating: tsr/hardware/sysinfo/inventory/sysinfo_CIM_BIOSAttribute.xml
inflating: tsr/hardware/sysinfo/inventory/sysinfo_CIM_Sensor.xml
inflating: tsr/hardware/sysinfo/inventory/sysinfo_DCIM_View.xml
inflating: tsr/hardware/sysinfo/inventory/sysinfo_DCIM_SoftwareIdentity.xml
inflating: tsr/hardware/sysinfo/inventory/sysinfo_CIM_Capabilities.xml
inflating: tsr/hardware/sysinfo/inventory/sysinfo_CIM_StatisticalData.xml
creating: tsr/hardware/sysinfo/lcfiles/
inflating: tsr/hardware/sysinfo/lcfiles/lclog_0.xml.gz
inflating: tsr/hardware/sysinfo/lcfiles/curr_lclog.xml
creating: tsr/hardware/psu/
creating: tsr/hardware/idracstateinfo/
inflating: tsr/hardware/idracstateinfo/avc.log
extracting: tsr/hardware/idracstateinfo/avc.log.persistent.1
[..snip..]
收集 MDE 代理程式資訊
資料會使用 mde-agent-information
命令收集,並將 格式設定為 JSON 至 /hostfs/tmp/runcommand/mde-agent-information.json
。 JSON 檔案位於記憶體帳戶中的數據擷取 zip 檔案中。 腳本會在指定的裸機電腦上執行一連串 mdatp
的命令。
此範例會 mde-agent-information
執行不含自變數的命令。
az networkcloud baremetalmachine run-data-extract --name "bareMetalMachineName" \
--resource-group "cluster_MRG" \
--subscription "subscription" \
--commands '[{"command":"mde-agent-information"}]' \
--limit-time-seconds 600
mde-agent-information
輸出
====Action Command Output====
Executing mde-agent-information command
MDE agent is running, proceeding with data extract
Getting MDE agent information for bareMetalMachine
Writing to /hostfs/tmp/runcommand
================================
Script execution result can be found in storage account:
https://cmzhnh6bdsfsdwpbst.blob.core.windows.net/bmm-run-command-output/f5962f18-2228-450b-8cf7-cb8344fdss63b0-action-bmmdataextcmd.tar.gz?se=2023-07-26T19%3A07%3A22Z&sig=X9K3VoNWRFP78OKqFjvYoxubp65BbNTq%2BGnlHclI9Og%3D&sp=r&spr=https&sr=b&st=2023-07-26T15%3A07%3A22Z&sv=2019-12-12
收集的 JSON 物件範例
{
"diagnosticInformation": {
"realTimeProtectionStats": $real_time_protection_stats,
"eventProviderStats": $event_provider_stats
},
"mdeDefinitions": $mde_definitions,
"generalHealth": $general_health,
"mdeConfiguration": $mde_config,
"scanList": $scan_list,
"threatInformation": {
"list": $threat_info_list,
"quarantineList": $threat_info_quarantine_list
}
}
收集 MDE 支持診斷
從 mde-support-diagnostics
命令收集的數據會使用 MDE Client Analyzer 工具,將命令和相關記錄檔的信息 mdatp
組合在一起。 記憶體帳戶 tgz
檔案將包含 zip
名為的 mde-support-diagnostics-<hostname>.zip
檔案。 zip
應連同任何支援要求一起傳送 ,以確保支援小組在需要時可以使用記錄來進行疑難解答和根本原因分析。
此範例會 mde-support-diagnostics
執行不含自變數的命令。
az networkcloud baremetalmachine run-data-extract --name "bareMetalMachineName" \
--resource-group "cluster_MRG" \
--subscription "subscription" \
--commands '[{"command":"mde-support-diagnostics"}]' \
--limit-time-seconds 600
mde-support-diagnostics
輸出
====Action Command Output====
Executing mde-support-diagnostics command
[2024-01-23 16:07:37.588][INFO] XMDEClientAnalyzer Version: 1.3.2
[2024-01-23 16:07:38.367][INFO] Top Command output: [/tmp/top_output_2024_01_23_16_07_37mel0nue0.txt]
[2024-01-23 16:07:38.367][INFO] Top Command Summary: [/tmp/top_summary_2024_01_23_16_07_370zh7dkqn.txt]
[2024-01-23 16:07:38.367][INFO] Top Command Outliers: [/tmp/top_outlier_2024_01_23_16_07_37aypcfidh.txt]
[2024-01-23 16:07:38.368][INFO] [MDE Diagnostic]
[2024-01-23 16:07:38.368][INFO] Collecting MDE Diagnostic
[2024-01-23 16:07:38.613][WARNING] mde is not running
[2024-01-23 16:07:41.343][INFO] [SLEEP] [3sec] waiting for agent to create diagnostic package
[2024-01-23 16:07:44.347][INFO] diagnostic package path: /var/opt/microsoft/mdatp/wdavdiag/5b1edef9-3b2a-45c1-a45d-9e7e4b6b869e.zip
[2024-01-23 16:07:44.347][INFO] Successfully created MDE diagnostic zip
[2024-01-23 16:07:44.348][INFO] Adding mde_diagnostic.zip to report directory
[2024-01-23 16:07:44.348][INFO] Collecting MDE Health
[...snip...]
================================
Script execution result can be found in storage account:
https://cmmj627vvrzkst.blob.core.windows.net/bmm-run-command-output/7c5557b9-b6b6-a4a4-97ea-752c38918ded-action-bmmdataextcmd.tar.gz?se=2024-01-23T20%3A11%3A32Z&sig=9h20XlZO87J7fCr0S1234xcyu%2Fl%2BVuaDh1BE0J6Yfl8%3D&sp=r&spr=https&sr=b&st=2024-01-23T16%3A11%3A32Z&sv=2019-12-12
下載執行結果檔之後,可以解壓縮支援檔案進行分析。
MDE 用戶端分析器所收集資訊的範例清單
Archive: mde-support-diagnostics-rack1compute02.zip
inflating: mde_diagnostic.zip
inflating: process_information.txt
inflating: auditd_info.txt
inflating: auditd_log_analysis.txt
inflating: auditd_logs.zip
inflating: ebpf_kernel_config.txt
inflating: ebpf_enabled_func.txt
inflating: ebpf_syscalls.zip
inflating: ebpf_raw_syscalls.zip
inflating: messagess.zip
inflating: conflicting_processes_information.txt
[...snip...]
硬體匯總狀態
資料會使用 hardware-rollup-status
命令收集,並將 格式設定為 JSON 至 /hostfs/tmp/runcommand/rollupStatus.json
。 JSON 檔案位於記憶體帳戶中的數據擷取 zip 檔案中。 收集的數據會顯示機器子系統的健康情況。
此範例會 hardware-rollup-status
執行不含自變數的命令。
az networkcloud baremetalmachine run-data-extract --name "bareMetalMachineName" \
--resource-group "clusete_MRG" \
--subscription "subscription" \
--commands '[{"command":"hardware-rollup-status"}]' \
--limit-time-seconds 600
hardware-rollup-status
輸出
====Action Command Output====
Executing hardware-rollup-status command
Getting rollup status logs for b37dev03a1c002
Writing to /hostfs/tmp/runcommand
================================
Script execution result can be found in storage account:
https://cmkfjft8twwpst.blob.core.windows.net/bmm-run-command-output/20b217b5-ea38-4394-9db1-21a0d392eff0-action-bmmdataextcmd.tar.gz?se=2023-09-19T18%3A47%3A17Z&sig=ZJcsNoBzvOkUNL0IQ3XGtbJSaZxYqmtd%3D&sp=r&spr=https&sr=b&st=2023-09-19T14%3A47%3A17Z&sv=2019-12-12
收集的範例 JSON
{
"@odata.context" : "/redfish/v1/$metadata#DellRollupStatusCollection.DellRollupStatusCollection",
"@odata.id" : "/redfish/v1/Systems/System.Embedded.1/Oem/Dell/DellRollupStatus",
"@odata.type" : "#DellRollupStatusCollection.DellRollupStatusCollection",
"Description" : "A collection of DellRollupStatus resource",
"Members" :
[
{
"@odata.context" : "/redfish/v1/$metadata#DellRollupStatus.DellRollupStatus",
"@odata.id" : "/redfish/v1/Systems/System.Embedded.1/Oem/Dell/DellRollupStatus/iDRAC.Embedded.1_0x23_SubSystem.1_0x23_Current",
"@odata.type" : "#DellRollupStatus.v1_0_0.DellRollupStatus",
"CollectionName" : "CurrentRollupStatus",
"Description" : "Represents the subcomponent roll-up statuses.",
"Id" : "iDRAC.Embedded.1_0x23_SubSystem.1_0x23_Current",
"InstanceID" : "iDRAC.Embedded.1#SubSystem.1#Current",
"Name" : "DellRollupStatus",
"RollupStatus" : "Ok",
"SubSystem" : "Current"
},
{
"@odata.context" : "/redfish/v1/$metadata#DellRollupStatus.DellRollupStatus",
"@odata.id" : "/redfish/v1/Systems/System.Embedded.1/Oem/Dell/DellRollupStatus/iDRAC.Embedded.1_0x23_SubSystem.1_0x23_Voltage",
"@odata.type" : "#DellRollupStatus.v1_0_0.DellRollupStatus",
"CollectionName" : "VoltageRollupStatus",
"Description" : "Represents the subcomponent roll-up statuses.",
"Id" : "iDRAC.Embedded.1_0x23_SubSystem.1_0x23_Voltage",
"InstanceID" : "iDRAC.Embedded.1#SubSystem.1#Voltage",
"Name" : "DellRollupStatus",
"RollupStatus" : "Ok",
"SubSystem" : "Voltage"
},
[..snip..]
檢視輸出
請注意命令執行tar.gz壓縮檔的連結。 tar.gz檔名會識別叢集管理員資源群組 儲存體 帳戶中的檔案。 您也可以使用連結直接存取輸出 zip 檔案。 tar.gz檔案也包含壓縮的解壓縮命令檔輸出。 藉由在選擇性自變數 --output-directory
中指定目錄路徑,將輸出檔案從記憶體 Blob 下載到本機目錄。