A complete machine learning project focused on building and optimizing an SMS spam detector. The process begins with loading and cleaning text data, followed by feature extraction using TF-IDF vectorization. Several models are trained, including Multinomial Naive Bayes and Logistic Regression. Advanced techniques are employed to boost performance, such as hyperparameter tuning with GridSearchCV, handling class imbalance with SMOTE, and feature engineering by incorporating message length. The final trained model and vectorizer are saved using joblib and deployed as a user-friendly, interactive web application with Streamlit.
