-
Notifications
You must be signed in to change notification settings - Fork 49
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Complete text of BDB? #3
Comments
We have the front matter. It just hasn't made it into the release. The On 12/3/2013 10:22 AM, biblicalhumanities wrote:
|
Hi @DavidTroidl, How much of the text BDB is currently posted in BrownDriverBriggs.xml? Is there a rough estimate of how much remains a work in progress? How can people help with getting it completed? thank you, |
Hi, Brown, Driver, Briggs is a huge work. We have all the entries Peace, David On 2/26/2015 4:11 PM, Razi Shaban wrote:
This email has been checked for viruses by Avast antivirus software. |
Have you given any thought to scraping a website that has the BDB posted? e.g. http://biblehub.com/hebrew/776.htm I'm not sure how the terms of use for the BDB are, but as the BDB is in the public domain, I don't see a reason why scraping the digital version there might not be allowed. The attribution given there is as follows: "Brown-Driver-Briggs Hebrew and English Lexicon, Unabridged, Electronic Database. |
Judging by a quick look at that entry, their database is abridged. I would think that what we have already at least has as much as that one and is unencumbered by their copyright assertions. |
@DavidTroidl this is a wonderful resource! I stumbled across it looking for some lexical information that I was not able to get at through the Accordance UI, and was able to export exactly what I needed using a simple XML parser. I see that "all entries are represented" from your comments above, but I was just wondering if you know for sure if all stems are present for those entries? |
I just came across an entry recently that seemed to need its senses On 3/7/2016 2:14 AM, Laney Stroup wrote:
This email has been checked for viruses by Avast antivirus software. |
http://www.ericlevy.com/Revel/BDB/BDB/main.htm This version of the BDB appears to be complete, although I have seen a few minor errors - numbering of senses being off, in particular. It looks to be parseable, with some effort. |
Wow, that is an impressive piece of work, thanks for the link. I wonder if he would make his source files available. |
From the looks of it, R. Eric Levy copied it from biblecentre.net, which is no longer online. I reached out to R. Levy, but haven't yet heard back. It's relatively easy to download the entire html of the website. Then it's just a small matter of parsing. :) The base text is in the public domain, but some of the emendations here make me wonder if this was digitized from a newer version that someone may try to assert rights over. In any case, the core material is squarely in the public domain, and no one could protest if the core work of the BDB were parsed and redistributed from here. |
I had the BDB from BibleCentre.net years ago, and had done some
significant work with it. Then I deleted everything I had, due to this
post
https://blogs.thegospelcoalition.org/justintaylor/2008/06/24/biblecentrenet-intellectual-property/.
There was no provenance of the data, and it appeared suspect. Certainly
BDB is in the public domain, but someone put extensive work into making
those files, and I personally would not use them without permission.
…On 12/22/2016 2:52 AM, Lev Eliezer Israel wrote:
From the looks of it, R. Eric Levy copied it from biblecentre.net,
which is no longer online. I reached out to R. Levy, but haven't yet
heard back. It's relatively easy to download the entire html of the
website. Then it's just a small matter of parsing. :)
The base text is in the public domain, but some of the emendations
here make me wonder if this was digitized from a newer version that
someone may try to assert rights over. In any case, the core material
is squarely in the public domain, and no one could protest if the core
work of the BDB were parsed and redistributed from here.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#3 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AAKwBTQSl7KbtUZhKRX_tqg1Ihqmybozks5rKiwqgaJpZM4BRd9a>.
---
This email has been checked for viruses by Avast antivirus software.
https://www.avast.com/antivirus
|
Thanks, David.
On 12/22/16 8:09 PM, David Troidl
wrote:
I had the BDB from BibleCentre.net years ago, and had
done some
significant work with it. Then I deleted everything I had, due to
this
post
https://blogs.thegospelcoalition.org/justintaylor/2008/06/24/biblecentrenet-intellectual-property/.
There was no provenance of the data, and it appeared suspect.
Certainly
BDB is in the public domain, but someone put extensive work into
making
those files, and I personally would not use them without
permission.
On 12/22/2016 2:52 AM, Lev Eliezer Israel wrote:
From the looks of it, R. Eric Levy copied it from
biblecentre.net,
which is no longer online. I reached out to R. Levy, but
haven't yet
heard back. It's relatively easy to download the entire html
of the
website. Then it's just a small matter of parsing. :)
The base text is in the public domain, but some of the
emendations
here make me wonder if this was digitized from a newer
version that
someone may try to assert rights over. In any case, the core
material
is squarely in the public domain, and no one could protest if
the core
work of the BDB were parsed and redistributed from here.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#3 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AAKwBTQSl7KbtUZhKRX_tqg1Ihqmybozks5rKiwqgaJpZM4BRd9a>.
…---
This email has been checked for viruses by Avast antivirus
software.
https://www.avast.com/antivirus
—
You are receiving this because you commented.
Reply to this email directly, view
it on GitHub, or mute
the thread.
{"api_version":"1.0","publisher":{"api_key":"05dde50f1d1a384dd78767c55493e4bb","name":"GitHub"},"entity":{"external_key":"github/openscriptures/HebrewLexicon","title":"openscriptures/HebrewLexicon","subtitle":"GitHub repository","main_image_url":<a class="moz-txt-link-rfc2396E" href="https://cloud.githubusercontent.com/assets/143418/17495839/a5054eac-5d88-11e6-95fc-7290892c7bb5.png">"https://cloud.githubusercontent.com/assets/143418/17495839/a5054eac-5d88-11e6-95fc-7290892c7bb5.png"</a>,"avatar_image_url":<a class="moz-txt-link-rfc2396E" href="https://cloud.githubusercontent.com/assets/143418/15842166/7c72db34-2c0b-11e6-9aed-b52498112777.png">"https://cloud.githubusercontent.com/assets/143418/15842166/7c72db34-2c0b-11e6-9aed-b52498112777.png"</a>,"action":{"name":"Open in GitHub","url":<a class="moz-txt-link-rfc2396E" href="https://github.com/openscriptures/HebrewLexicon">"https://github.com/openscriptures/HebrewLexicon"</
a>}},"updates":{"snippets":[{"icon":"PERSON","message":"@DavidTroidl in #3: I had the BDB from BibleCentre.net years ago, and had done some \nsignificant work with it. Then I deleted everything I had, due to this \npost \nhttps://blogs.thegospelcoalition.org/justintaylor/2008/06/24/biblecentrenet-intellectual-property/. \nThere was no provenance of the data, and it appeared suspect. Certainly \nBDB is in the public domain, but someone put extensive work into making \nthose files, and I personally would not use them without permission.\n\n\nOn 12/22/2016 2:52 AM, Lev Eliezer Israel wrote:\n\u003e\n\u003e From the looks of it, R. Eric Levy copied it from biblecentre.net, \n\u003e which is no longer online. I reached out to R. Levy, but haven't yet \n\u003e heard back. It's relatively easy to download the entire html of the \n\u003e website. Then it's just a small matter of parsing. :)\n\u003e\n\u003e The base text is in the public domain, but some of the emendations \n\u003e here make
me wonder if this was digitized from a newer version that \n\u003e someone may try to assert rights over. In any case, the core material \n\u003e is squarely in the public domain, and no one could protest if the core \n\u003e work of the BDB were parsed and redistributed from here.\n\u003e\n\u003e —\n\u003e You are receiving this because you were mentioned.\n\u003e Reply to this email directly, view it on GitHub \n\u003e \u003chttps://github.com/openscriptures/HebrewLexicon/issues/3#issuecomment-268739997\u003e, \n\u003e or mute the thread \n\u003e \u003chttps://github.com/notifications/unsubscribe-auth/AAKwBTQSl7KbtUZhKRX_tqg1Ihqmybozks5rKiwqgaJpZM4BRd9a\u003e.\n\u003e\n\n\n\n---\nThis email has been checked for viruses by Avast antivirus software.\nhttps://www.avast.com/antivirus\n"}],"action":{"name":"View Issue","url":<a class="moz-txt-link-rfc2396E" href="#3 (comment)">"https://github.com/openscriptures/HebrewLe
xicon/issues/3#issuecomment-268796246"</a>}}}
|
Ah, well that is disappointing. I'm not terribly surprised, though. Do we have any idea who the proper originator of the BDB data is? I'd love to have a conversation with them. Perhaps there's a way we can get it released into the commons legitimately. |
Oooh, best not to mess with that. |
Here's a gift! https://liberalarts.utexas.edu/mes/news/article.php?id=6768 It's a bit rough, the data - it needs to be converted from its current form into proper unicode. There's some node/js code that does some setup, but doesn't go so far as parsing the data. Even so - this seems like a great bounty of data. |
The key map for Bwhebb is at Bible Works Fonts. This should help in constructing a search and replace script for the Hebrew. The consonants appear in reverse order, but each is followed by its vowel: bybia' means אָבִיב |
There is a macro for Word 2003 that converts BibleWorks fonts to unicode. It's in the "OLE and DDE" section of the help file (towards the end: section 58 in BWks 9). It includes this guidance:
I have put the macro itself in a Gist, if that helps. But anyone with BibleWorks (for many versions back) will have this already. |
All, A few things about this data.
|
Here’s the transcoder (it was private, sry). |
@jackweinbender This is great. Thank you. |
FWIW; the JSON file in the transcoder should be exhaustive. Is there a plan to encode this as a TEI document? I’ve also got a simple digital site to display the BDB by page like (http://jastrow.semitics-archive.org), if I can find it. I’ve been playing with some computer vision stuff to split up the images into entries/paragraphs that might make transcription (or perhaps corrected OCR?) easier. |
I’m going to try to keep up with these projects; I’d like to help. I was very disappointed when our NEH grant was not renewed. The BDB is such a fantastic work of scholarship, it is tragic that there isn’t a complete, open, digital edition f it yet. |
@jackweinbender said:
I hope you can find it! That would be valuable, although something the GKC on Wikisource would be remarkable. But please ping me if you mount your digi-BDB! Thanks. |
I will. I’m out of town this week, but i’ll post a link whenever I get it deployed. |
I actually reimplemented my BDB site using the data from this repo's XML file, since the former iteration used the buggy one referenced above. Everything seems to still work, so... as promised: http://bdb.semitics-archive.org/ It probably sucks on mobile, FWIW. |
Nice work! I would like to see the complete text of BDB, including the introduction. Is that something you would consider? Does the answer depend on who does the work?
The text was updated successfully, but these errors were encountered: