Need to Remove Empty Columns from the .csv file

Dinesh Prajapati 126 Reputation points
2022-12-15T12:08:34.613+00:00

I want to remove empty column from the csv file. for eg. at column2, column 5 the column is empty without header or full column is empty, then how can we remove the empty columns. Any idea please

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,164 questions
Active Directory Federation Services
Active Directory Federation Services
An Active Directory technology that provides single-sign-on functionality by securely sharing digital identity and entitlement rights across security and enterprise boundaries.
1,250 questions
Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
2,945 questions
{count} votes

1 answer

Sort by: Most helpful
  1. AnnuKumari-MSFT 32,821 Reputation points Microsoft Employee
    2022-12-16T06:02:00.903+00:00

    Hi @Dinesh Prajapati ,

    Welcome to Microsoft Q&A platform and thanks for posting your question here.

    As I understand your query, you are trying to delete the columns having empty columnnames or columnvalues from the dataset . Please let me know if that is not the ask here.

    You can use select function in pyspark on top of remove(" "). Check below:

    df = sqlContext.createDataFrame([(1,"", "a"," "), (2,"", "b"," "), (5,"", "c"," "), (8,"", "d"," ")], ("id"," ", "name"," "))  
      
    +---+---+---+---+  
    | id|   |name|   |  
    +---+---+---+---+  
    |  1|   |  a|   |  
    |  2|   |  b|   |  
    |  5|   |  c|   |  
    |  8|   |  d|   |  
    +---+---+---+---+  
      
    a=list(set(df.columns))  
    a.remove(" ")  
    df=df.select(a)  
    df.show()  
      
    +---+---+  
    |name| id|  
    +---+---+  
    |  a|  1|  
    |  b|  2|  
    |  c|  5|  
    |  d|  8|  
    +---+---+  
    

    Hope this will help. Please let us know if any further queries.

    ------------------------------

    • Please don't forget to click on 130616-image.png or upvote 130671-image.png button whenever the information provided helps you.
      Original posters help the community find answers faster by identifying the correct answer. Here is how
    • Want a reminder to come back and check responses? Here is how to subscribe to a notification
    • If you are interested in joining the VM program and help shape the future of Q&A: Here is how you can be part of Q&A Volunteer Moderators

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.