Blog Posts

Week 5: Advancing Integration

This week marked significant strides in our Verifone project. Each team delivered key accomplishments, moving us closer to our goals. The collaboration between the UI, Feature Engineering (FE), and Machine Learning (ML) teams demonstrated remarkable synergy as we tackled core components and refined functionalities across the board.

The UI Team reached a significant milestone by completing the site selection of core visual components and their integration. While site-specific statistics remain in progress, clustering sites on the map has been implemented, providing a more intuitive way to visualize the data. Additionally, the search bar is fully functional, seamlessly integrated with the map, and includes improved statistics details to enhance usability.

User Interface – Clustering

The Feature Engineering Team shifted its focus to feature correlation analysis, which will help us identify and limit feature redundancy. The team also made significant headway in building the pipeline to simulate multi-site data live streams, ensuring a reliable feed into the broader data processing pipeline.

On the Machine Learning Team, spatial models saw a breakthrough with the successful implementation of Gaussian Mixture Models (GMM) and DBSCAN. The team also fixed the long-term memory issues in the CNN-LSTM model and completed the ensemble pipeline. Synthetic data testing and debugging were conducted to validate these models, paving the way for more accurate predictions and insights.

As we prepare for the coming week, each team has outlined clear goals to maintain momentum. The UI team will finalize site-specific statistics and explore implementing filters and dynamic cluster/marker colors based on volume. The Feature Engineering team will wrap up feature correlation analysis and work on reformatting to reduce preprocess runtimes. Meanwhile, the ML team plans to isolate and benchmark different ensemble models, fine-tune atomic models, and adjust ensemble weights.

We are set for another productive week with consistent progress across all teams. Let’s keep up the excellent work! All teams, we are set for another productive week. Let’s keep up the excellent work!

Week 4: Integration and Model Progress

This week marked significant progress for our Verifone project, with each subteam making meaningful strides toward our goals. The UI team completed the core visual components of the main screen, providing a solid foundation for displaying key data. They also began backend integration, including mapping site displays, statistics visualization, and connecting model outputs with the UI. These efforts bring us closer to a cohesive and functional user interface.

Updated User Interface

The Feature Engineering (FE) team focused on preparing JSON data files for batch processing, a crucial step for handling larger datasets efficiently. They explored new ideas for feature engineering other event types, broadening the scope of our model’s potential applications. Additionally, they worked on optimizing the preprocessing pipeline for model training and prediction, ensuring smoother integration across the project.

The Machine Learning (ML) team made significant advancements this week, starting with extracting anomalies from new data to present actionable results for the UI. They also generated synthetic data to test and validate their models, calculating baseline metrics such as F1 scores for the ensemble design. Furthermore, prototypes of classification models, including linear and logistic regression, were completed, establishing a framework for the next stage of development.

Next week, each subteam has outlined clear objectives to build on this week’s momentum. The UI team will complete the visual components for the site popup screen, refine statistics displays for the primary and popup screens, and enhance functionality by improving the search bar and filters. The FE team will focus on implementing a caching system to track PREAUTH sequences and redesigning the build sequence function to handle multiple sites. The ML team plans to explore unsupervised learning models such as DBSCAN and GMM, finalize the ensemble design, and fine-tune each atomic model and ensemble weights to optimize performance metrics.

Statistics Display

With steady progress across subteams and clearly defined plans, we are well to achieving our following milestones. See you next week with more updates and accomplishments!

Week 3: Progress Update

This week, our team made significant progress across all fronts, refining our pipeline and strengthening our model’s capabilities. As a team, we successfully presented QRB 1 and began preparing slides for QRB 2 and the FDR documentation. Meanwhile, work continued programming the UI frontend, ensuring a seamless and user-friendly experience.

The UI team took a significant step forward by initiating the programming pipeline, successfully automating and integrating it with the model output. This achievement streamlines the process, making it more efficient and scalable. On the feature engineering (FE) side, the team reworked the CHAMPS preprocessing scripts, optimizing them for smoother data handling. Additionally, they refined the training process by allowing more flexibility in using feature engineering columns, ensuring more adaptable model training.

User Interface

The machine learning (ML) team also made substantial advancements, introducing an ensemble anomaly matrix and developing an ensemble batch model training script. These enhancements improve anomaly detection by leveraging multiple models for more robust predictions. Another key milestone was the automation of synthetic data injection, making it easier to test and validate models under controlled conditions. The team also formatted the output for anomaly detection, ensuring consistency and clarity in results.

Further strengthening our dataset, the FE team incorporated additional spatial features, including region segmentation, population size, gender ratio, and median household income. These enhancements provide more contextual data, helping the model make more informed predictions.

Our project is becoming increasingly refined and automated with these developments, bringing us closer to a fully functional and efficient anomaly detection system. We’re looking forward to continuing this momentum in the coming weeks. See you next time!

Week 2: Building Momentum

This week, Team INSIGHT continued making strides in our Verifone project, collaborating to complete the required QRB documents. These efforts ensured that we were aligned with Verifone’s expectations and prepared for the subsequent phases of development. Each subteam made notable progress, laying a strong foundation for upcoming tasks.

The UI team focused on designing the whole pipeline’s backend architecture, incorporating Verifone’s suggestions. This design work is critical to ensuring a seamless integration of the front and back ends. The Feature Engineering (FE) team implemented z-score calculations for duration and amount using a rolling window, experimenting with different designs to optimize runtime. They also made significant progress in spatial feature engineering, extracting key geographical attributes such as city, state, and zip code. The Machine Learning (ML) team advanced their work on isolation forests, seq2seq models, and Optuna integration, further refining our approach to anomaly detection.

Looking ahead, the subteams have ambitious plans for next week. The UI team will continue Angular development, focusing on map integration and clustering research while finalizing anomaly and statistics scripts for UI implementation. The FE team aims to complete their proposed spatial features and optimize the preprocessing pipeline in collaboration with the ML team. The ML team will work on formatting JSON outputs for backend integration, ensuring that anomaly detection results are effectively communicated.

Next Tasks

As we reflect on this week, it’s clear that collaboration and communication remain the keys to our success. We are excited to build on this momentum as we move closer to delivering a polished and impactful solution.

Week 1: Semester 2

As we embark on a new semester, the excitement within Team INSIGHT is palpable. After a refreshing winter break, we are all rejuvenated and ready to dive back into the Verifone project with fresh energy and ideas. This week, we reconnected with one another and met with our coach to align our goals and plans for the semester. It was an excellent opportunity to reflect on past progress and set the stage for the exciting work ahead.

Our team hit the ground running this week, achieving significant milestones across all subteams. The UI team began developing the user interface, familiarizing themselves with Angular’s development cycle, and setting up the backend for the site. This included configuring the Mongo Atlas database and testing HTTP methods, marking a strong start toward building a robust and dynamic platform.

User Interface

The Feature Engineering (FE) team made notable strides in data preprocessing by implementing cyclical and binary encoding methods. These refinements enhanced the timeline for data preparation, ensuring smoother downstream processes.

The Machine Learning (ML) team also had a productive start, refining the prediction script and enhancing the model auto-search functionality. These improvements lay the groundwork for more accurate anomaly detection and efficient model integration moving forward.

Looking ahead, each subteam has a focused plan to build on this momentum. The UI team will continue their Angular development efforts, emphasizing integrating a map and researching clustering methods. They will also finalize anomaly and statistics scripts for UI implementation. The FE team plans to complete the proposed features, including normal, z-score, and spatial grouping. In contrast, the ML team will work on formatting JSON outputs and integrating Isolation Forest and Optuna for improved anomaly detection.

Critical Path

We are thrilled about the progress made in this first week and eager to continue building on it in the coming weeks. Here’s to a semester filled with innovation, collaboration, and success!

Goodbye for now, and see you next week!

Week 13: Wrapping Up the Semester

This week marked an important milestone as we presented our System Level Design Review (SLDR) to our coach and liaisons. The presentation was a culmination of our hard work this semester, showcasing our progress, design decisions, and plans for the next phases of the project. Receiving their feedback was invaluable and will guide our next steps as we refine our design and approach.

INSIGHT – SLRD Presentation

In addition to the SLDR presentation, we had our individual evaluation meetings with our coach. These sessions allowed us to reflect on our contributions to the project, identify areas for growth, and set goals for the upcoming semester. The feedback from these evaluations was constructive and will help us return in the spring with renewed focus and determination.

As this is the final blog of the semester, it’s a great moment to reflect on everything we’ve accomplished as a team. From onboarding, exploratory data analysis, and developing initial prototypes to presenting our SLDR, we’ve laid a strong foundation for the work ahead.

We’re looking forward to continuing this momentum in the spring semester, where we will dive deeper into implementation, testing, and finalizing our deliverables. Until then, happy holidays, and see you in the new year!

Week 12: Progressing Through System-Level Refinements

This week, our team focused on finalizing and presenting the System Level Design Review (SLDR). We worked collaboratively to prepare a comprehensive presentation that showcased our progress and outlined key components of our system. The presentation provided an excellent opportunity to receive feedback, which we documented in a Feedback Memo to guide our next steps. Alongside the presentation, we also revised and refined the SLDR document, ensuring it accurately reflected our current state and plans moving forward.

The Feature Engineering team achieved significant progress by implementing two spatial feature engineering techniques. They explored geospatial grid binning to analyze transaction data within specific geographic regions and began integrating weather data to provide contextual insights based on location. These additions represent a promising direction for enhancing the model’s ability to detect anomalies.

Meanwhile, the UI team revisited and improved several aspects of the interface based on feedback from the PID Day presentation. The updates aimed to create a more intuitive and user-friendly design, ensuring that the dashboard aligns with user needs and expectations. These changes brought us closer to a polished interface that meets the project’s goals.

Updated UI

Next week, the subteams will tackle focused objectives. The Feature Engineering team plans to finalize spatial feature engineering and experiment with clustering techniques to uncover patterns in transaction behavior. The Model Engineering team will refactor the model code to enhance performance and maintainability. The UI team will continue refining the design based on user testing feedback and begin implementing the interface in Angular for improved scalability and responsiveness.

As we wrap up Week 12, we feel proud of the progress we’ve made and the teamwork that has driven these achievements. The journey so far has been a rewarding one, and we look forward to tackling the challenges ahead. Thank you to everyone for your dedication and hard work—here’s to a strong finish in the weeks to come!

Week 11: Preparing for Prototype Inspection Day

This week, our team, INSIGHT, made significant strides across multiple fronts of the Verifone project. We collaborated to prepare for and deliver our Prototype Inspection Day presentation, showcasing our progress and receiving valuable feedback to guide future work. Additionally, we began developing the System Level Design Review (SLDR) document, a critical step in refining our overall approach and ensuring alignment across subteams.

The Feature Engineering team focused on debugging issues in the previous exploratory data analysis (EDA) and data transformation process. They also comprehensively reviewed site-level performance, identifying opportunities for implementing Spatial Feature Engineering. The Model Engineering team addressed critical technical challenges by resolving graphics card issues with HiPerGator and debugging their processes to improve model recall. Meanwhile, the UI team conducted user testing of the prototype, gathering insights to enhance functionality and usability for PI Day.

Looking ahead, each subteam has defined clear objectives for the upcoming week. The Feature Engineering team will focus on implementing Spatial Feature Engineering, evaluating the importance of features in removing redundancies and comparing the performance gains from these additional features. The Model Engineering team aims to improve the model’s baseline performance, emphasizing recall optimization. The UI team plans to iterate on their designs, incorporating feedback from user testing to improve the interface. Our liaison engineers will continue reviewing the potential of proposed features, providing critical feedback to ensure they align with the overarching project goals.

As we progress, we remain committed to our shared objectives and look forward to delivering a polished and impactful project. Thank you for following our progress, and stay tuned for more updates next week!

Week 10: Data Analysis and Model Debugging

This week, our team continued to make significant strides on the Verifone project as we explored and prepped the new data set we received. Ensuring that all data was in the correct format and safe for processing was a priority, and we worked closely as a team to verify these initial steps. This collective effort laid a solid foundation for further analysis and model improvements.

The Feature Engineering team took the lead on Exploratory Data Analysis (EDA) of the new data, focusing on Temporal Engineering techniques. We implemented a 20-minute interval calculation to capture evolving patterns within the data, aiming to reveal trends and outliers that could enhance model accuracy. These initial EDA steps pave the way for more advanced feature extraction, including spatial features, which we plan to explore further in the coming weeks.

Data Analysis – Events per Hour

Meanwhile, the Model Engineering team debugged the HiPerGator model to ensure it integrates smoothly with the new data. We also began analyzing distinct datasets within Jupyter notebooks to understand specific patterns from different locations. This detailed examination will guide our model adaptations and help us benchmark its performance with the new data.

On the UI side, the team focuses on refining the user interface and preparing for user testing sessions scheduled for PI (Product Increment) Day. These sessions will provide critical insights into the design’s usability, ensuring it meets user needs and project goals.

Looking ahead to next week, we plan to come together to finalize preparations for PI Day, aiming to make a strong impression with our data insights and refined model. We’ll also begin drafting the Prototype Inspection and System Level Design Review documents to outline our progress and next steps. The Feature Engineering team will dive deeper into additional EDA methods, and the Model Engineering team will benchmark our current model with the new data to evaluate its performance. The UI team will continue refining and testing the interface based on user feedback.

It’s been a rewarding week with valuable advancements. We’re excited about the progress we’re making and look forward to further accomplishments. See you next week!

Week 9: Exploring New Data and Spooky Progress

INSIGHT Team Halloween: Celebrating teamwork and spooky vibes as we dive into data! 🎃👻

This Halloween week, our team dived into exploring a fresh batch of data we received, ensuring it’s in the right format and safe to open. We worked together to check data quality, structure, and format, aligning everything for smooth processing. We also considered privacy and security practices, making sure to handle the data responsibly. The Feature Engineering Team took charge of initial exploratory data analysis (EDA) and began brainstorming potential new features, working closely with our liaison engineer, who provided valuable feedback on feasibility and alignment with our project goals. This guidance helped us start shaping a more refined approach to feature engineering as we move forward.

The Model Engineering Team focused on extending our ensemble model, finishing the integration of the remaining models, and running tests with GPU acceleration to boost performance. These adjustments helped ensure that our model is optimized for speed and accuracy, setting us up to handle the new data effectively. The team was pleased to see initial test runs going smoothly, and we look forward to further fine-tuning in the coming weeks.

Looking ahead, our next steps are to incorporate the feedback from our recent PDR presentation and work together on upcoming Prototype Inspection Day and System Level Design Review documents. Subteam goals include continued EDA and implementation of feature engineering on the new dataset by the Feature Engineering Team, while the Model Engineering Team will focus on benchmarking our current model’s performance with the latest data. Additionally, our liaison engineers will continue to review our feature engineering ideas and provide critical feedback to make sure we’re aligned with the project’s bigger objectives.

That wraps up this week’s update! Have a fun and safe Halloween, and stay tuned for more progress as we continue enhancing our Verifone project. 👻