Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Initial Code Extraction Process #4

Open
erikaz1 opened this issue Sep 5, 2024 · 0 comments
Open

Initial Code Extraction Process #4

erikaz1 opened this issue Sep 5, 2024 · 0 comments

Comments

@erikaz1
Copy link

erikaz1 commented Sep 5, 2024

Hi Thomas, my name is Erika, I am a grad student at UChicago in the States. Your AcademiaOS paper and UI is fascinating and absolutely amazing work.

In the 2023 paper, you write, "First, the system creates initial codes from the raw documents. Initial codes are short text strings describing emergent themes, concepts, and patterns in the language of the raw document" (5). Do you have any idea how exactly GPT chooses its particular output strings/themes? This is an ongoing concern I have regarding Grounded Theory coding since without leading research questions or directions, a text can have a range of key, albeit tangential, themes. This further begs the question, how do we determine the cutoff for this long list of concepts?

I've independently run prompts to GPT turbo/40-mini asking it to explain its initial code selection process. It said it looks for repeated words/phrases, phrases with emphasis, and probably uses context clues to find main ideas. Some of the decisions could not be explained with specificity.

Thanks again for sharing your awesome research.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant