![]() The scraping library, that we are going to use will be Beautifulsoup4. Requests is pretty easy to use and straightforward. The networking library we are going to use will be requests. Implement a way to play html5 videos on the Android version of Kiwix (Bug can be found here).Finally: Creating the first prototype zim files.Implementing the search engine in Javascript, that allows the user to search through all of the content.Implementing the local database, that manages all the content.First three days of the second week (24.Writing a python script, that dumps the scraped data into the HTML pages, creating static content.Writing the Scraper, that scrapes the TED translation page on ww.Writing the Scraper, that scrapes TED.com.Creation of a concept including a conceptional Zim file, that demonstrates the very basics of this project.Planning on how this project can be realized.Run zimwriterfs to create the corresponding ZIM file of your target directory.Fill the HTML templates with the data from the XML/RDF and write the index pages in a target directory.Interesting to read this to get an idea how to store a database client side.Create the necessary templates of the index web pages (For the search/filter feature, a javascript client side solution should be tried) with Jinja2. ![]() Subtitle don't make so much sense for TEDx.Retrieve the video subtitle files from.Download videos and re-encode them if necessary.TEDx talks by language are available here.A whole list of the available TED talks is available here (official) or here (unofficial).Retrieve the list of TED(x) presentations with medatas in a local database.It would be best to scrape this site and add the metadata (Category, playlist etc.) by ourselves later on. will give you list of all the TED talks in one place sorted by popularity. gives you a list of all the TED talks sorted by playlists and categories. We should focus on scraping that site, because the old one will eventuelly get discontinued. )Īs it currently stands there is a redesign of the TED site, that is currently available at. The ZIM should provide a simple filtering/search solution to find content (by author, language, title, conference, topic.Videos should be available in HTML5 and subtitles need to be supported.The data should be scraped from ted.com.A script (python) able to create easily following ZIM files of the TED and TEDx videos with the possibility to filter by language/conference/topic.6.7 Templating solution to create pages.6.6 Javascript client side filter/search solution.6.5 Building HTML sites out of the scraped content.Pour nous, un passage à l'utilisation exclusive du format ZIM représente donc non pas une simple intégration à Okawix, mais une refonte totale de notre architecture. À l'heure actuelle Okawix utilise le précurseur du format ZIM : le format Zeno.Ī switch to an exclusive use of the ZIM format would therefore mean much more to us than a mere integration into Okawix, it would mean deep, substantial changes to the architecture of our applications. Okawix currently uses the Zeno format, which has been a precursor to the ZIM format. Lecteurs Kiwix Le lecteur Kiwix vous permet d'accéder aux contenus au format ZIM stockés sur votre téléphone ou votre ordinateur. The Kiwix Reader allows you to access content packages in the ZIM format that are stored on your phone or computer. Kiwix supporte le format ZIM, un format ouvert extrêmement efficace et compressé permettant de stocker des méta-donnés additionnelles. Kiwix supports the ZIM format, a highly compressed open format with additional meta-data. Kiwix utilise le format ZIM qui n'existe que depuis le début de l'année 2009. Kiwix uses the ZIM format which was created recently, in February 2009.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |