As a Data Analyst at CrowdStrike, you will focus on data and corpus labeling, as well as other data-related tasks critical to supporting our large language models (LLMs) and cybersecurity initiatives. This role is crucial in enhancing our products capabilities by ensuring the accuracy and quality of the data used to train models and detect threats, thereby supporting the overall mission of the Generative AI Research Center.
Requirements
- Label and annotate cybersecurity-related datasets to prepare them for analysis and machine learning tasks
- Ensure labeling accuracy and consistency across different datasets
- Gather data from various cybersecurity sources
- Clean and preprocess data to make it suitable for analysis and modeling
- Perform exploratory data analysis to uncover patterns, trends, and insights related to cybersecurity threats and vulnerabilities
- Utilize statistical methods and tools to interpret data and identify potential security issues
- Create and maintain dashboards and reports to communicate findings to cybersecurity stakeholders
- Develop visualizations to present data in a clear and concise manner
- Work closely with analysts, data scientists, engineers, and other team members to support their data needs
- Support the implementation and optimization of MLOps pipelines
- Participate in team meetings and contribute to project planning and discussions
- Document processes, methodologies, and insights gained from data analysis and labeling activities
- Maintain clear records of data sources, cleaning steps, and labeling criteria
Benefits
- Market leader in compensation and equity awards
- Comprehensive physical and mental wellness programs
- Competitive vacation and holidays for recharge
- Paid parental and adoption leaves
- Professional development opportunities for all employees
- Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections
- Vibrant office culture with world class amenities
- Great Place to Work CertifiedTM across the globe