Let’s Work Together
Internship: OSINT & Text Analysis
Internship Structure
Open positions: 1
Duration: 12 weeks.
Time Commitment: Minimum 5 hours per week.
Compensation: Unpaid.
Start Date: Target April; flexible depending on candidate availability.
Supervision: Reports directly to the two Lead Analysts.
Evaluation: Successful delivery of functional, documented code and assigned analytical outputs.
Application Requirements: CV and short statement of interest, including GitHub profile or demonstrable technical work.
Location: Remote, in-person meetings possible in New York City.
Only shortlisted candidates will be contacted for an interview.
About GéoPoly Global
GéoPoly Global is a New York–based policy advisory startup focused on structured analysis of geopolitical dynamics, policy risk, and strategic positioning. The firm develops analytical frameworks and proprietary tools that combine political, economic, and regulatory indicators to support forward-looking assessment and decision-making.
About this internship
The internship supports the development of an in-house policy monitoring and country-relations assessment platform. The project integrates structured datasets, regulatory signals, sanctions tracking, and narrative analysis into a unified analytical system designed to evaluate state behavior, identify historical trends, correlate shifts with leadership changes, and forecast geopolitical risk to inform client advisory.
Because interns will have access to proprietary methodologies, internal rating frameworks, data architecture, and strategic design elements that are not publicly disclosed, a Non-Disclosure Agreement protects the project’s structure, technical implementation, and analytical models without restricting the intern’s general skills or experience gained.
The OSINT & Text Analysis Intern will lead the transformation of large volumes of publicly available policy and media content into structured analytical outputs for an ongoing in-house project. The role focuses on extracting, cleaning, categorizing, and interpreting unstructured text data from official releases, regulatory communications, and news sources, converting raw information into searchable, tagged, and analytically usable insights that support monitoring and strategic assessment.
This intern will work in close coordination with the Data Engineering Intern to ensure seamless integration between data acquisition, storage, and analysis. While the Data Engineering role manages database architecture and automated ingestion pipelines, this role builds on that infrastructure to process, structure, and interpret text-based data. Together, they ensure that raw policy and media data is reliably collected, securely stored, and transformed into searchable, analytically useful outputs.
Key Responsibilities
Perform Open Source Intelligence (OSINT) gathering from public APIs, official websites, databases, and permitted social media sources.
Develop text processing scripts to clean, normalize, and tokenize raw text data.
Apply Natural Language Processing (NLP) techniques such as Sentiment Analysis, Named Entity Recognition (NER), and text classification to extract structured insights.
Design tagging taxonomies and entity frameworks to ensure consistent categorization of actors, policies, and themes.
Validate and evaluate NLP outputs to reduce false positives and improve analytical reliability.
Document data sources, assumptions, and methodologies to ensure reproducibility and auditability.
Preferred Previous Experience
Prior experience conducting structured research or OSINT projects involving large volumes of text data.
Experience developing NLP, text classification, or data processing scripts in Python within research or applied settings.
Strongly preferred: experience analyzing policy documents, regulatory communications, geopolitical reporting, or public institutional data.
Required Skills
Advanced Python, including Pandas for data manipulation and preprocessing.
Familiarity with NLP libraries such as spaCy, NLTK, or Hugging Face transformers.Experience with web scraping tools (e.g., BeautifulSoup or Scrapy) and clear understanding of robots.txt and data ethics.
Understanding of API-based data collection.
Strong awareness of legal and ethical considerations in OSINT and data usage.
Ability to structure unorganized text into consistent, queryable formats.
Other Requirements
Currently enrolled in a university program. Preference for Master’s students; strong upper-level undergraduate students (third or fourth year/senior standing) will also be considered.
Eligible fields include Data Science, Computer Science, Engineering, Information Systems, or related quantitative disciplines. Students in Public Policy, International Affairs, Political Science, or similar fields will also be considered if they demonstrate substantial coursework or applied experience in data analysis, programming, or quantitative methods.
Willingness to sign a Non-Disclosure Agreement governed by New York State law, covering project content, methodologies, and proprietary materials (not personal skills or experience gained).
Must be located in the United States and authorized to work without requiring visa sponsorship, CPT, or OPT.
Strong English proficiency required; additional languages are an asset.
What the Intern Will Gain
Direct, hands-on experience building and operating real-world data and analytical systems within an active geopolitical advisory project. Exposure to structured policy monitoring, data engineering workflows, and applied analytical methodologies.
Opportunity to translate academic knowledge into practical execution and produce tangible outputs.
Academic credit may be possible where applicable, but it is the intern’s responsibility to confirm eligibility and secure approval from their institution.
High-performing interns may be invited to remain involved in the startup’s analyst network and contribute to the continued development and growth of GéoPoly Global.
Learning Outcomes
Transform large volumes of unstructured policy and media text into structured, queryable analytical datasets using NLP techniques.
Develop and validate classification, entity extraction, and sentiment models with measurable performance standards.
Build reproducible OSINT workflows that integrate ethical data collection, documentation, and analytical rigor into decision-support outputs.
GéoPoly Global is an equal opportunity organization. We evaluate applicants based on merit, skills, and experience, without regard to race, color, religion, sex, national origin, age, disability, gender identity, sexual orientation, or any other protected status.