CÔNG TY CỔ PHẦN WEBIFY GROUP
Data Crawler Staff ( Nhân Viên Thu Thập Dữ Liệu)
Lượt xem: 966, Ngày duyệt: 11/02/2025
Ngày đăng
11/02/2025
Số lượng tuyển
1
Chức vụ
Nhân viên
Hình thức làm việc
Toàn thời gian cố định
Yêu cầu giới tính
Không yêu cầu
Kinh nghiệm
1 năm
Bằng cấp
Cử nhân
Ngôn ngữ
Tiếng Anh
Ngành nghề
Công nghệ thông tin
Mô tả công việc
1. Professional Scraping System Development
- Design cross-platform Python crawling scripts
- Build scalable systems
- Develop parallel crawling solutions
- Manage large, multi-threaded data streams
Technologies:
- Scrapy, BeautifulSoup
- Selenium
- Asyncio, Multiprocessing
- Proxy management
- IP rotation techniques
2. Data Processing and Normalization
- Processing Methods:
- Develop API data cleaning processes
- Data transformation algorithms
- Integrity checks
- Remove noisy data
Tools:
- Pandas
- Data validation techniques
- Machine Learning preprocessing
3. Database Management
- Advanced SQL:
- Complex queries
- Performance optimization
4. Monitoring & Optimization
Strategy:
- Manage scraping system operations.
- Track scraping performance
- Challenge handling:
- IP blocking
- Speed limiting
- CAPTCHA
- Design cross-platform Python crawling scripts
- Build scalable systems
- Develop parallel crawling solutions
- Manage large, multi-threaded data streams
Technologies:
- Scrapy, BeautifulSoup
- Selenium
- Asyncio, Multiprocessing
- Proxy management
- IP rotation techniques
2. Data Processing and Normalization
- Processing Methods:
- Develop API data cleaning processes
- Data transformation algorithms
- Integrity checks
- Remove noisy data
Tools:
- Pandas
- Data validation techniques
- Machine Learning preprocessing
3. Database Management
- Advanced SQL:
- Complex queries
- Performance optimization
4. Monitoring & Optimization
Strategy:
- Manage scraping system operations.
- Track scraping performance
- Challenge handling:
- IP blocking
- Speed limiting
- CAPTCHA
Quyền lợi được hưởng
-Enjoy full social insurance, health insurance, labor contracts, vacation days and other benefits according to state regulations.
-Parking allowance
-Regular annual salary increase
-Training and capacity development to meet job requirements and promotion path
-Participate in courses when necessary
-Weekly/monthly/quarterly/yearly bonuses and project bonuses
-Holiday/Tet bonuses
-Young, friendly and dynamic working environment.
-Travel: 1 time/year
-Parking allowance
-Regular annual salary increase
-Training and capacity development to meet job requirements and promotion path
-Participate in courses when necessary
-Weekly/monthly/quarterly/yearly bonuses and project bonuses
-Holiday/Tet bonuses
-Young, friendly and dynamic working environment.
-Travel: 1 time/year
Yêu cầu công việc
- Bachelor's degree (GPA > 3.0)
- Major: Data science, Computer engineering, Data related fields
- English: TOEIC > 700 of IELTS >5.5
- Technical Skills
- Python Ecosystem
- Asyncio, Multiprocessing
- Data cleaning techniques
- Machine Learning preprocessing
- Advanced error handling
- Database & Big Data
- SQL (Intermediate to Advanced)
- NoSQL database management
- PySpark
- Data warehousing
- In-depth Experience
- Minimum 1-2 years
- Project implementation
- - Web scraping
- Automatic data processing
- Big data crawling
- System analysis
- Problem solving
- Independent & team working
- Time management
- Logical thinking
- Nice to have experiences
- Big Data experience
- Data pipeline design
- Working with diverse APIs
- Professional certifications
- Creativity and initiative in proposing ideas
- Major: Data science, Computer engineering, Data related fields
- English: TOEIC > 700 of IELTS >5.5
- Technical Skills
- Python Ecosystem
- Asyncio, Multiprocessing
- Data cleaning techniques
- Machine Learning preprocessing
- Advanced error handling
- Database & Big Data
- SQL (Intermediate to Advanced)
- NoSQL database management
- PySpark
- Data warehousing
- In-depth Experience
- Minimum 1-2 years
- Project implementation
- - Web scraping
- Automatic data processing
- Big data crawling
- System analysis
- Problem solving
- Independent & team working
- Time management
- Logical thinking
- Nice to have experiences
- Big Data experience
- Data pipeline design
- Working with diverse APIs
- Professional certifications
- Creativity and initiative in proposing ideas
Yêu cầu hồ sơ
CV
Địa điểm làm việc
19 Hồ Văn Huê, P9, Q.Phú Nhuận, Thành phố Hồ Chí Minh
Từ khóa
CÔNG TY CỔ PHẦN WEBIFY GROUP
Địa chỉ:
19 Hồ Văn Huê, P9, Q.Phú Nhuận, Thành phố Hồ Chí Minh
Quy mô:
10 - 30 người nhân viên
Việc làm hot
Tuyển Nhân Viên Kinh Doanh Thị Trường - Lương 8tr/tháng + Hoa hồng
25 triệu - 30 triệu
Hà Nội
Còn 57 ngày
(Gấp) HOT Tuyển Nhân viên vận hành hệ thống ISO - Thu nhập Hấp dẫn
10 triệu - 12 triệu
Bình Dương
Còn 57 ngày