How to Check If Column Exists in Spark DataSet in Java
How can we check if a column exists in a Spark DataSet
in Java?
In the Java API, df.columns()
returns a String[]
, so we can use any method to check a value exists in an array.
1. Using Arrays.asList()
and contains()
Let’s convert the array into a list and use contains()
.
String columnToCheck = "maybeColumn";
Arrays.asList(df.columns()).contains(columnToCheck)
2. Using Arrays.stream()
and anyMatch()
We can also create a stream of the elements and run anyMatch()
on that stream.
String columnToCheck = "maybeColumn";
Arrays.stream(df.columns()).anyMatch(columnToCheck::equals);