Search Sources Guide¶
Comprehensive guide to searching each academic database
Table of Contents¶
PubMed¶
Overview¶
- Coverage: 35+ million citations
- Focus: Biomedical and life sciences
- Provider: U.S. National Library of Medicine
- Best For: Clinical medicine, biology, health sciences
Basic Search¶
Advanced Search Techniques¶
1. MeSH Terms (Medical Subject Headings)
2. Field-Specific Search
# Title field
lixplore -P -q "CRISPR[Title]" -m 20
# Title/Abstract
lixplore -P -q "gene therapy[Title/Abstract]" -m 30
# Author field
lixplore -P -q "Smith J[Author] AND cancer[Title]" -m 15
3. Publication Type Filters
lixplore -P -q "diabetes AND systematic review[pt]" -m 20
lixplore -P -q "COVID-19 AND randomized controlled trial[pt]" -m 30
Best Practices¶
DO: - Use MeSH terms for precision - Combine with date filters for recent research - Search author publications - Use field tags [Title], [Author], [Journal]
DON'T: - Expect full-text availability for all articles - Use for non-biomedical topics - Ignore publication type filters
Common Workflows¶
Clinical Research:
lixplore -P -q "hypertension AND treatment[Title/Abstract]" \
-d 2020-01-01 2024-12-31 \
-m 50 \
--sort newest \
-X enw
Author Publications:
Systematic Review Preparation:
Crossref¶
Overview¶
- Coverage: 140+ million records
- Focus: All academic disciplines with DOIs
- Provider: Crossref (DOI registration agency)
- Best For: Multi-disciplinary research, citation metadata
Basic Search¶
Advanced Techniques¶
1. DOI Lookup
2. Journal-Focused Search
3. Citation Metadata
Best Practices¶
DO: - Use for DOI lookup - Combine with enrichment for complete metadata - Search across all disciplines - Export to BibTeX for LaTeX
DON'T: - Expect full-text access - Use for preprint-only searches - Rely on abstract availability
Common Workflows¶
Bibliography Building:
Multi-Disciplinary Review:
lixplore -C -q "artificial intelligence healthcare" \
-m 200 \
--sort newest \
-S first:50 \
-X xlsx
DOAJ¶
Overview¶
- Coverage: 19,000+ journals, 8+ million articles
- Focus: Open access peer-reviewed content
- Provider: DOAJ (community-driven)
- Best For: Free full-text access, open science
Basic Search¶
Advanced Techniques¶
1. Open Access PDFs
2. PDF Link Display
3. Multi-Format Export
Best Practices¶
DO: - Use for freely accessible articles - Download PDFs directly - Combine with other sources (deduplicate) - Check for open access versions
DON'T: - Expect comprehensive coverage - Use as only source for lit reviews - Assume latest articles indexed immediately
Common Workflows¶
Open Access Literature Review:
PDF Collection:
lixplore -J -q "climate change" -m 30 --show-pdf-links
# Click links or:
lixplore -J -q "climate change" -m 30 --download-pdf
EuropePMC¶
Overview¶
- Coverage: 42+ million records
- Focus: Biomedical, life sciences (European)
- Provider: EMBL-EBI
- Best For: European research, open access content
Basic Search¶
Advanced Techniques¶
1. Grant-Funded Research
2. Open Access Filter
Best Practices¶
DO: - Use alongside PubMed for biomedical - Check for open access content - Use deduplication with PubMed
DON'T: - Use for non-biomedical topics - Expect different results from PubMed always - Skip deduplication
Common Workflows¶
Biomedical Research (EU Focus):
arXiv¶
Overview¶
- Coverage: 2.3+ million preprints
- Focus: Physics, math, CS, quantitative fields
- Provider: Cornell University
- Best For: Latest research, preprints, open access PDFs
Basic Search¶
Advanced Techniques¶
1. Latest Research
2. PDF Downloads
3. Clickable PDF Links
4. Author Tracking
Best Practices¶
DO: - Use for cutting-edge CS/physics research - Download PDFs (always available) - Track specific authors - Sort by newest for latest papers
DON'T: - Expect peer review - Use for medical/clinical research - Trust all methodologies blindly
Common Workflows¶
Stay Current in CS:
Author Monitoring:
PDF Collection:
Multi-Source Strategies¶
All Sources Search¶
Advantages: - Comprehensive coverage - Cross-disciplinary results - Maximum recall
Disadvantages: - Slower (5x API calls) - Many duplicates (use -D!) - Can be overwhelming
Custom Combinations¶
Biomedical (PubMed + EuropePMC):
Computer Science (arXiv + Crossref):
Open Access Only (DOAJ + arXiv):
Peer-Reviewed Recent (PubMed + Crossref):
Deduplication Strategy¶
Essential for Multi-Source:
Choose Strategy:
# Strict (bibliography)
lixplore -A -q "query" -m 100 -D strict
# Loose (discovery)
lixplore -A -q "query" -m 200 -D loose
# Auto (balanced)
lixplore -A -q "query" -m 150 -D
Source Selection Decision Tree¶
Need biomedical/clinical research?
├─ Yes → PubMed or EuropePMC
└─ No → Continue
Need latest CS/physics preprints?
├─ Yes → arXiv
└─ No → Continue
Need open access PDFs?
├─ Yes → DOAJ or arXiv
└─ No → Continue
Need multi-disciplinary?
├─ Yes → Crossref or All sources
└─ No → Continue
Need comprehensive coverage?
└─ Use All sources (-A) with deduplication (-D)
Performance Comparison¶
| Aspect | PubMed | Crossref | DOAJ | EuropePMC | arXiv |
|---|---|---|---|---|---|
| Speed | Fast | Fast | Medium | Medium | Fast |
| Coverage | ★★★★☆ | ★★★★★ | ★★★☆☆ | ★★★★☆ | ★★★☆☆ |
| Full Text | Some | No | Yes | Some | Yes |
| Metadata | ★★★★★ | ★★★★★ | ★★★☆☆ | ★★★★☆ | ★★★☆☆ |
Last Updated: 2024-12-28