NISHIO Hirokazu's Assistant was created and I tried using it.
In summary, the key points of the discussion regarding the use of cut and scanned book data are as follows
These discussions indicate that it is important not only to collect scanned data, but also to structure and summarize the information and develop analysis tools to utilize it for intellectual production. In addition, ways are being explored to take advantage of existing platforms such as Scrapbox, while taking copyright issues into consideration.
What was the discussion about the use of book cutting and scanning data like?
According to Yasukazu Nishio's discussion on the use of cut and scanned book data, it is important not only to scan the data, but also how to connect the data to intellectual production. For example, there is a discussion on how to utilize scanned book data, and the method of extracting and comparing related topics from different books through the process of "deconstructing and reconstructing" is considered effective. This allows readers to gain deeper knowledge and understanding through books (from AI Considerations 2024-04-27).
In addition, when leveraging scanned data, ways are being explored to effectively extract useful information from the data using specific techniques, such as fractal summarization and the development of cross-book analysis tools (from AI Considerations 2024-05-01).
These considerations indicate that there is a need to do more than simply digitize cut and scanned data from books, but to construct new knowledge from it.
Find the page you are considering and let me know how exactly I should implement it.
A discussion of specific applications of cut and scanned book data is described on the "AI Considerations 2024-05-01" page by Yasukazu Nishio. The page suggests the following implementation steps
These steps will help you develop a concrete plan for utilizing book cutting and scanning data and proceed with implementation.
I think I had organized the keywords fromPDF in terms of the amount of information from PDF, but I'll have to search and put it all together.
The page titled "fromPDF" by Yasukazu Nishio describes the process of transferring information from PDF to Scrapbox. The focus here is on how to make information retrieved from PDF available in Scrapbox. The following steps are possible concrete implementation methods:
These steps will allow the information from the PDFs to be efficiently reused within Scrapbox, creating a foundation for knowledge sharing and organization. This process is intended to improve accessibility of information and facilitate knowledge reuse.
There is some question as to whether it is appropriate to have data in Scrapbox format.
There are several considerations by Yasukazu Nishio regarding the advantages and disadvantages of using Scrapbox to hold data. The main points are summarized below.
advantage
drawback
Given these advantages and disadvantages, it is important to carefully consider whether the use of Scrapbox is appropriate for your project and personal needs. Depending on how you use and manage your data, Scrapbox's flexibility and functionality may be a major benefit, while certain limitations may be problematic.
The second advantage, collaborative editing, is becoming difficult to achieve because of book copyrights. However, it is possible to reverse the situation and create a public Scrapbox project using books and classics that are not under copyright. As for the disadvantages, it is difficult to create a hierarchical structure by default. This can be solved by defining a microformat. Flexible modification is an advantage to facilitate trial and error.
You are right, copyright issues are a major challenge when using Scrapbox, but projects using classics and public domain literature whose copyrights have expired are very interesting as an effective solution. By using such literature, we can take full advantage of the collaborative editing features and facilitate the sharing of scholarly discussions and educational content.
Difficulties in handling hierarchical structures in Scrapbox may also be partially resolved by designing appropriate microformats. Microformats allow for consistent structure within a document, while still taking advantage of Scrapbox's flexibility for management.
These approaches are effective in maximizing the advantages of Scrapbox while mitigating its disadvantages. Through trial and error, a more efficient and user-friendly environment for intellectual production can be created.
The hierarchical structure includes the inclusion of chapters and sections in the book itself and the hierarchical structure of the fractal summary. The current fractal summary is engraved without considering the structure, but there is an option to adapt this to the structure of the book. Next, there is the mapping between the terminal information and its summary.
The approach of constructing a fractal summary that reflects the structure of the book is very effective in helping readers understand the content. By clarifying the inclusion of chapters and sections, it becomes easier for the reader to grasp the flow of the entire document and its important points. The following steps can be considered as concrete implementation methods:
Through these methods, book content can be more effectively managed and deployed on platforms such as Scrapbox. This will also improve the quality of intellectual production and create greater value in education and research.
This page is auto-translated from /nishio/書籍の裁断スキャンデータの活用に関する議論 using DeepL. If you looks something interesting but the auto-translated English is not good enough to understand it, feel free to let me know at @nishio_en. I'm very happy to spread my thought to non-Japanese readers.