This role focuses on one core objective: identifying, extracting, and delivering high-quality, verified prospect data for doctors’ offices, clinics, and healthcare practices across the United States.
The candidate will scrape, mine, and compile accurate lead data from public directories, government healthcare databases, medical platforms, and business listing sources. A strong understanding of the US healthcare ecosystem is essential, including how medical practices are structured and where decision-makers can be identified.
This is not a basic data entry position. We are looking for someone who understands lead quality, can identify key decision-makers within healthcare organizations, and can build structured, conversion-ready prospect lists for the sales team.
The ideal candidate must also be highly proficient in Microsoft Excel and capable of managing large-scale datasets accurately, consistently, and efficiently.
Design and execute large-scale data extraction workflows to collect high-quality prospect data from multiple online sources, including public directories, business listing platforms, regulatory databases, and industry-specific portals.
Build structured scraping processes to extract company profiles, contact details, decision-maker information, and operational data across multiple industries.
Utilize structured and unstructured data sources including websites, government databases, APIs (where available), and publicly accessible registries to ensure comprehensive lead coverage.
Identify and extract key stakeholders within organizations such as owners, founders, directors, managers, or relevant decision-makers along with verified contact information (email, phone, LinkedIn where applicable).
Develop logic-based scraping approaches to segment high-quality leads based on business signals such as activity level, reviews, operational scale, hiring signals, and service demand indicators.
Clean, normalize, and de-duplicate large datasets to ensure accuracy, consistency, and usability of scraped data.
Structure all extracted data into standardized formats for CRM and sales use, including fields such as: Company Name, Industry, Contact Person, Role, Email, Phone, Location, Source URL, Lead Score, and Notes.
Maintain data integrity through validation checks, format standardization, and periodic database auditing.
Handle large-scale datasets efficiently using advanced Excel and/or data processing tools.
Enhance raw scraped data by enriching missing fields using additional sources and enrichment tools.
Identify high-value prospects using intent signals, business growth indicators, and digital footprint analysis.
Categorize and prioritize leads based on quality scoring models aligned with sales conversion potential.
Continuously discover and integrate new data sources, directories, and scraping opportunities to improve lead volume and accuracy.
3–5 years of experience in lead generation, data scraping, or B2B data research roles.
Strong hands-on experience with web scraping tools or scripting (Python, Selenium, BeautifulSoup, or similar).
Familiarity with CRM platforms such as HubSpot, Salesforce, or Zoho.
Experience with lead enrichment and verification tools such as Hunter.io, Apollo.io, ZoomInfo, Yelp, or Lusha.
Working knowledge of LinkedIn Sales Navigator for identifying decision-makers.
Basic understanding of healthcare operations, billing workflows, and administrative structures in medical practices.
Prior exposure to healthcare-focused sales or lead generation environments (preferred).