How to flatten nested list via pyspark

How to flatten nested list via pyspark

tiger analytics
Input:

my_list = [[1], [2, 3], [4, 5, 6, 7]]

Expected Output:

[1,2,3,4,5,6,7]


Solve:

#1

my_list = [[1], [2, 3], [4, 5, 6, 7]]

flat_list = [num for sublist in my_list for num in sublist]

print(flat_list)


#2

my_list = [[1], [2, 3], [4, 5, 6, 7]]

flat_list = []

for sublist in my_list:

    for num in sublist:

        flat_list.append(num)


print(flat_list)


#3

my_list = [[1], [2, 3], [4, 5, 6, 7]]

flat_list = sum(my_list, [])
print(flat_list)


Comments