Collision on non-unique "headnodehost" hostname across HDInsight clusters
The context: Let's suppose I have multiple HdInsight4.0 clusters. Also suppose that I would like to access the Hadoop services eg jobhistory server running inside these clusters. Let's suppose I get the corresponding jobhistory address from each…
Azure HDInsight HBase - Create non-admin SSH user
I would like to create a standard user account on all HDInsight nodes that is a non-admin account. This will be used to login to the nodes and run some basic commands. There is an admin account created by default on all nodes, but is there a way for…
HWC and Hive (HDinsight): reserved keyword as a column name
with an attempt to save the dataframe for a table which has a column name 'timestamp' and SaveMode.Overwrite, the following exception occurs: org.apache.hadoop.hive.ql.parse.ParseException:line 1:47 cannot recognize input near 'timestamp' 'timestamp'…
![](https://techprofile.blob.core.windows.net/images/VfQFAmOikEWfBHko2XlWTA.png?8D7F33)
Setting StatusFolder for HDInsight Spark job triggered through Data Factory
Currently our Spark job runs result in a number of folders with random guid names being created in the root directory of the container we use as our HDInsight cluster storage. This seems to be the folder in the context of which the job runs, it has a…
![](https://techprofile.blob.core.windows.net/images/VfQFAmOikEWfBHko2XlWTA.png?8D7F33)
Outbound proxy to HDInsight management
Hello, How can we route traffic to HDInsight management through a proxy to avoid opening NSG outbound and also to not use an Azure firewall? https://video2.skills-academy.com/en-us/azure/hdinsight/hdinsight-restrict-outbound-traffic Thank you, …
Conda installation in HDHinsight taking too long to run and time out after several hours - what could be the reason?
Package installation with anaconda in HDHinsight cluster taking too long to run and time out after several hours - what could be the reason?
![](https://techprofile.blob.core.windows.net/images/VfQFAmOikEWfBHko2XlWTA.png?8D7F33)
Looking for HDInsight Script Action sample scripts for installing python package in PySpark 3
Looking for HDInsight Script Action sample scripts for installing python package in PySpark 3
![](https://techprofile.blob.core.windows.net/images/VfQFAmOikEWfBHko2XlWTA.png?8D7F33)
AZURE HDInsight VM list
On this page, they mentioned what minimum server required. But didn't mention how many VM required. For example minimum how many head nodes required and how many worker nodes required for HBASE? Please advice
Unable to write dataframe into hive table
Team , We are using Hive Interactive cluster and Spark cluster . We have done the LLAP related configuration on Spark cluster . Now both the cluster are interacting each other without any issues. I tried to load dataset (adl gen2 filesystem) into hive…
Would like to request increase of quota for HDInsight West Europe
Would like to request increase of quota for HDInsight West Europe
Unable to create HDInsight cluster using azure powershell
Unable to Create a HDInsight cluster using the below link https://video2.skills-academy.com/en-in/azure/hdinsight/hdinsight-administer-use-powershell New-AzHDInsightCluster: Line | 10 | -DefaultStorageAccountName…
AZURE HDInsight.
Hi, I am new to AZURE HDInsight. We have planned to create a new project in HDInsight. It's required Hadoop, BI, Analytics, IoT, Migration,Ambari ...etc Minimum how many VM'S required for this project. Please Advice
Deploy an edge node
Hi everyone, it is possible deploy an edge node with specific kernel on an existing HDInsight cluster? Best Regards, Simone
Spark cluster to read Hive on differnt HDI cluster
I have two different HDI clusters say Cluster A , Cluster B . One HDInsight (Cluster A) is spark cluster and another one(Cluster B) is provisioned with hive. I need to run spark processing in Cluster A and need to connect to hive which is in Cluster…
What will be the max throughput of Kafka rest proxy enabled on HDINSIGHT Kafka cluster
I would like to set up a Kafka cluster, which needs an ingestion (producer) throughput of around 150MB/Second. In order to achieve that in my local setup I am needing 4 rest proxy servers of 8 CPUs each. However, when I am trying a create a Kafka cluster…
HDFS FileSystem utility not supported for multiple container/storage account
Hi Team, I am facing a problem on renaming file location(path) from one container to another container using rename function from Hadoop(HDI) Filesystem utility(https://hadoop.apache.org/docs/stable/api/org/apache/hadoop/fs/FileSystem.html). Get to…
nodes are created with anonomous subscription id
Hi there, When I create a hdinsight clusters, the cluster is created with the subscription id that I am in but every time the nodes are forming with different anonymous id and when I try to access the nodes for example zookeeper it is throwing 401…
![](https://techprofile.blob.core.windows.net/images/VfQFAmOikEWfBHko2XlWTA.png?8D7F33)
Nodes Details in Azure Hbase
Which node is better to process 1.5 millions records daily where total size of storage is 10tb. Its there any way to auto-scale Hbase cluster as there will be only 1 to 2 transaction a day but data size will be large.
Getting error when using SHC to query HBase data in Spark
Hello, I am trying to read data stored in HBase table from Spark cluster using SHC (reference link: https://video2.skills-academy.com/en-gb/azure/hdinsight/hdinsight-using-spark-query-hbase) I followed all the steps as is, but when running the command…
![](https://techprofile.blob.core.windows.net/images/VfQFAmOikEWfBHko2XlWTA.png?8D7F33)
Fetching the Spark Yarn log from Azure HDInsight
Hi Team, Currently through LIVY I am Posting/submitting spark jobs to Azure HDInsight Cluster. After job finishes I am looking into Spark History Server for yarn logs. Livy log for each spark job is not providing yarn logs. Can we Fetch the Spark…
![](https://techprofile.blob.core.windows.net/images/VfQFAmOikEWfBHko2XlWTA.png?8D7F33)