model-mirror

This repo is for the Model in the Mirror project that ran from July 20th to August 25th, 2023. The purpose of this project was to create a Python script that would take an input text, create machine generated text via an LLM, and use an NLP to create a C-test for language learning purposes. The main files within this repo are newgenerator.py, which is the passage generator, any one of the spacy files, which can be used to create the c-tests from a passage input, and the database files where the passages are stored. Overall, the project was a success, but there is much refinement and new angles that can be explored with more time and research (i.e. creating a direct pipeline from database to c-test generation, improving the c-test to exclude certain types of tokens).

But Wait! There's More!

After the project runtime was finished, there has been more work done as a result of me (Zeb) being rehired (hooray!).

What's New

pipeline.py

This is the direct pipeline from database to c-test generation.

spacyfunctions.py

The spacy functions (and most other functions not directly related to prompt generation) are all stored within this file.

Some updates:

testGen is the new primary generator. It's fully paramaterized and can exclude certain tokens from being gapped according to certain attributes. In summary:
1. Token Length!
2. Token Type!
3. Token Frequency!
4. Parts of Speech Included (Y/N)
Can now print 3 versions of the C-test to terminal, file, or directly to H5P content file.
A bunch of language-specific functions for simplicity's sake.
IN PROGRESS: Full-word gapping

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
__pycache__		__pycache__
ctest-databases		ctest-databases
generators		generators
h5pdev		h5pdev
passage-databases		passage-databases
testfiles		testfiles
3.5v4.txt		3.5v4.txt
README.md		README.md
assistantTest.py		assistantTest.py
blah.txt		blah.txt
chinese_ctest1.txt		chinese_ctest1.txt
doc_thoughts.ipynb		doc_thoughts.ipynb
korean_ctest1.txt		korean_ctest1.txt
oldmaterial.txt		oldmaterial.txt
oldsamples.txt		oldsamples.txt
paragraphmaterial.txt		paragraphmaterial.txt
pipeline.py		pipeline.py
portconvert.py		portconvert.py
portuguese_ctest1.txt		portuguese_ctest1.txt
replaceAstrixes.py		replaceAstrixes.py
spacyfunctions.py		spacyfunctions.py
thesaurus.json		thesaurus.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

model-mirror

But Wait! There's More!

What's New

pipeline.py

spacyfunctions.py

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

model-mirror

But Wait! There's More!

What's New

pipeline.py

spacyfunctions.py

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages