It would be better to add one more transformation step before saveAsTextFile, like: rdd.map(tuple => "%s,%s,%s".format(tuple._1, tuple._2, tuple._3)).saveAsTextFile(...) By manually convert to the format you what, and then write to HDFS. Thanks Jerry -----Original Message----- From: SK [mailto:skrishna.id@gmail.com] Sent: Wednesday, June 11, 2014 9:34 AM To: user@spark.incubator.apache.org Subject