How to integrate NLTK with Hadoop HDFS? -
i have working sentiment analysis program using nltk reads text .txt file placed in local machine. read txt file placed in hadoop hdfs , perform same sentiment analysis.
how can achieve ?
any pointers on topic appreciated !!!
so won't possible unless nltk can recognize hdfs. programs nltk allow pass data directly program. assuming case can use suggest in other answer, how run external program within mapper or reducer giving hdfs files input , storing output files in hdfs?. write small java adapter opens input stream of hdfs file , passes program want run.
if sounds trouble, or isn't possible reason in case, can use hdfs place file local address.
Comments
Post a Comment