Data Crawler

Data Crawler

Rancho’s Data Crawler lets users crawl, manage and enrich data and metadata from PubMed, ClinicalTrials.gov, and other commonly used public resources. “Light Crawler” and “Deep Crawler” modes are customizable to your project’s needs to provide an efficient yet comprehensive search experience.

Light Crawler

Lets users extract valuable information from their target publications. The Light Crawler can be used for:

  • Publication identification, coverage, and metadata
  • Data parsing and cleaning
  • Easy export
  • Scalability

Deep Crawler

Designed to gather sample-level metadata from publications. The Deep Crawler can be used for:

  • Searching for a wide variety of metadata, including (but not limited to) sample identification numbers, sample dates, sample processing methods, and experimental conditions
  • Comparing results across different studies and datasets

Features

  • Access to millions of research papers, specimens, and datasets
  • Annotation Service – automatically annotates keywords in free-text fields from both metadata and sample levels against public ontologies
  • Customizable to extract data from various websites; can be used on static and dynamic web pages.
  • Can extract data in different formats, including JSON, CSV, and Excel, for easy integration into various analytical tools and platforms.