Statement: (Multilabel Classification) A tag is a word or phrase that describes the topic of the question. Every question should have at least one tag, and can have up to five tags. Tags can be newly created by the user (if the user has reputation above 1500), or can be chosen from the list of tags available in the site. Tags help experts in finding the relevant questions that they can answer. Tags can also be used to find questions that are relevant or interesting to a user. Given this huge number of tags, it may be difficult for users to manually search appropriate tags while posting questions. Also, only users with good reputation can add new tags which in a way limit normal users from suggesting new tags
Since there are a huge number of tags, it is often a cumbersome process to search the correct tags. It may be useful to have an auto-tagging system that suggests tags to users depending on the content of the question.
Data Type:CSV files
train.csv (Id , title, body, tags)Test.csv (id, title, body)
Data Size: 10GB
We are building our course content and teaching methodology to cater to the needs to students at various levels of expertise and varying background skills. This course can be taken by anyone with a working knowledge of a modern programming language like C/C++/Java/Python. We expect the average student to spend at least 5 hours a week over a 6 month period amounting to a 145+ hours of effort. More the effort, better the results. Here is a list of customers who would benefit from our course:
Undergrad (BS/BTech/BE) students in engineering and science.
Grad(MS/MTech/ME/MCA) students in engineering and science.
Working professionals: Software engineers, Business analysts, Product managers, Program managers, Managers, Startup teams building ML products/services.