1. The syntax given alongside is of a Binary Literal in PySpark SQL. What does the parameter num represent here?
X { 'num [ ... ]' | "num [ ... ]" }
A.Any hexadecimal number from 0 to F (correct) B.Any integer from 0 to 9 C.Any integer from -1 to 9 D.None of these
2. Which of these is the correct way to use a Date Literal in PySpark SQL:
SELECT DATE '1997' AS col;
SELECT DATE '1997-01-20' AS col;
SELECT DATE '1997-01' AS col;
A.1 B.2 C.3 D.All of these (correct)
3. Which of the following can you use to process data using PySpark SQL:
PySQL
SQL
HiveQL
a.1 b.2 c.3 d.2 and 3 (correct)
4. You are training a K-means clustering model in PySpark 2.4.4. Which of the following parameters can be used to control the distance threshold within which a center is considered to have converged?
a.maxIterations b.k c.epsilon (correct) d.seed
5. You are using the following PySpark method to perform an online update of centroids while working with a clustering model. What is the appropriate value of the decay factor to ensure that during the update, the weighted mean of the previous and new data is considered: Method
Post a Comment