I need to join two RDDs as part of my programming assignment. The problem is that the first RDD is nested, while the other is flat. I tried different things, but nothing seemed to work. Is there any expert on PySpark that can help me?
First RDD is:
[(('brand', 1), ('queen', 1), ('elizabeth', 1), ...),
(('50', 1), ('worst', 1), ('habit', 2), ...),
(('cost', 1), ('trump', 1), ('aid', 1), ..., ('hole', 1))]
Second RDD is:
[('brand', 1), ('queen', 3), ('elizabeth', 2), ...]