-
Notifications
You must be signed in to change notification settings - Fork 87

Description
Is there an existing issue for this?
- I have searched the existing issues
Feature Description
crawler all video transcripts from Khan Academy to create a list of 'learn' words or sentences
Web scraping category
EN
- MATH: HIGH SCHOOL & COLLEGE https://www.khanacademy.org/math
- TEST PREP https://www.khanacademy.org/test-prep
- SCIENCE https://www.khanacademy.org/science
- COMPUTING https://www.khanacademy.org/computing
- ARTS & HUMANITIES https://www.khanacademy.org/humanities
- ECONOMICS https://www.khanacademy.org/economics-finance-domain
- READING & LANGUAGE ARTS https://www.khanacademy.org/ela
- LIFE SKILLS https://www.khanacademy.org/college-careers-more
- PARTNER COURSES https://www.khanacademy.org/partner-content
RU
- МАТЕМАТИКА https://ru.khanacademy.org/math
- ЕСТЕСТВЕННЫЕ НАУКИ https://ru.khanacademy.org/science
- ЭКОНОМИКА И ФИНАНСЫ https://ru.khanacademy.org/economics-finance-domain
- ИНФОРМАТИКА https://ru.khanacademy.org/computing
- ИСКУССТВО И ГУМАНИТАРНЫЕ НАУКИ https://ru.khanacademy.org/humanities
Use Case
Only by studying the 'learn' word list from Khan Academy (subtitles/transcripts) can one fully grasp the knowledge by watching the Khan Academy videos, as learning requires review.
Even someone who doesn't know English at all can study the 'learn' word list and then immediately go to Khan Academy to watch the videos and gain knowledge and skills
Benefits
Contribute to global education, especially for regions where the Khan Academy website does not support their native languages, such as Africa. They can learn from the 'learn' word list and then go to Khan Academy to acquire knowledge
Add ScreenShots
Web scraping steps: 'Enter the web scraping category' (EN, RU), go to 1. MATH: HIGH SCHOOL & COLLEGE, and navigate to the second-level directory.
Early math review
> Enter the directoryUnit 1
> Click the play icon, and theVideo transcript
at the bottom of the website is the subtitles
Combine all the subtitles from the chapters under Early math review
into one file, such as Early math review.txt
.
Priority
High
Record
- I have read the Contributing Guidelines
- I'm a GSSOC'24 contributor
- I'm a VSoC'24 contributor
- I have starred the repository