I would like to work on a project for Fake News Detection especially for Indians news which are in different languages and different formats.
- Fake news as image with no or very less text
- Fake news on a blog site
- Fake news as Tweets
- Fake news in Hindi
- Fake news in the watsapp group and shared across.
Need your help on the approach. One approach I can think of is using OCR we can read the content of the post, then search those content in the google. If the news is not present in any of the famous print media then we can tag it as fake. However there can be many challenges in this. What if the print media itself gives any fake news shared by someone.
How to handle the scenario where there is no text in the image but the information shown as image is fake.
How to handle posts written in Hindi. ?
And even if we detect fake news, is there any way to make the person accountable for sharing it. ? I know it is little difficult problem to solve. But is there any work currently done by any company on this. ? Any starting point for me to get into this domain ?