0

I am writing a tool in python to analyze the Talmud (e.g. word counts, how many times a specific phrase is used etc.) and I need a free downloadable text file of the Gemara, preferably with Rashi and Tosafos.

Thanks for all your help!!

fartgeek
  • 168
  • 10

1 Answers1

2

Sefaria has all these texts available for free. The easiest way to download them is to go to the Sefaria-Export github repository here. You can download the entire repository as a ZIP by clicking on the green Code button at the top-right, and then choosing "Download ZIP". The files you're looking for will be in the txt folder, then the Talmud folder, then the Bavli folder. If you just want to download the Bavli folder, you might try one of the solutions here.

magicker72
  • 9,904
  • 28
  • 68
  • Thank you very much! – fartgeek Dec 25 '22 at 16:29
  • I have been having trouble with one issue: If a Tosafos starts one one amud, and ends on another, it cuts off at the end of the amud? Do you have a solution? – fartgeek Mar 01 '23 at 01:33
  • @fartgeek A new tosafot starts with a דיבור המתחיל — some words from the gemara followed by a dash or hyphen. You should be able to find all locations that don't start with a few words + hyphen/dash with a regex search. – magicker72 Mar 01 '23 at 03:57