In your opinion, does it make sense to create a new generation of something similar to ChatGPT, which will use databases built solely on the basis of continuously updated data, information, objectively verified knowledge resources taken from online scientific knowledge bases, online scientific portals and online indexing databases of scientific publications?
I'm curious to know what you think about this? This kind of solution based on an intelligent publication search system and an intelligent content analysis system of retrieved publications on an online scientific portal could be of great help to researchers and scientists. In my opinion, the creation of a new generation of something similar to ChatGPT, which will use databases built solely on the basis of online scientific knowledge bases, online scientific portals and online scientific publication indexing databases makes sense if basic issues of copyright respect are met and such tools use continuously updated and objectively and scientifically verified knowledge, data and information resources. With such a solution, researchers and scientists conducting research on a specific topic would have the opportunity to review the literature within the millions of scientific publications collected on specific online scientific portals and scientific publication indexing databases. Besides, what is particularly important, the mentioned partially automated literature review would probably be realized in a relatively short time. Thus, an intelligent system for searching and analyzing the content of scientific publications would, in a short period of time, from among the millions of texts archived in specific scientific publication indexing databases, select those publications in which other researchers and scientists have described analogous, similar, related, correlated, related, etc. issues, results of scientific research conducted, selected publications within the same scientific discipline, the same topic or in the interdisciplinary field. Besides, an intelligent system for searching and analyzing the content of scientific publications could also categorize the retrieved publications into those in which other researchers and scientists confirmed analogous conclusions of conducted similar research, polemicized with the results of other researchers' research on a specific topic, obtained other results from conducted research, suggested other practical applications of obtained research results realized on the same or similar topic, etc. However, for ethical reasons and properly conducted research, i.e., respecting the research results of other researchers and scientists, it would be unacceptable for this kind of intelligent system for searching and analyzing the content of many publications available on specific databases for indexing scientific publications to enable plagiarism, i.e., to provide research results, provide retrieved content on specific issues and topics, etc., without accurately providing the source of the data, description of the source data, names of the authors of the publications, etc., and some unreliable researchers would take advantage of this opportunity. This kind of intelligent system for searching and analyzing the content of scientific publications should give for all searched publications full bibliographic descriptions, source descriptions, footnotes containing all the data that are necessary to develop full source footnotes for possible citation of specific studies, research results, theses, data, etc. contained in other publications written by other researchers and scientists. So, building this kind of intelligent tool would make sense if ChatGPT-type tools were properly improved and the system of laws for their use appropriately supplemented so that the use of ChatGPT-type tools does not violate copyrights and that these tools are used in accordance with ethics and do not generate misinformation. Improving these tools so that they do not generate disinformation, do not create "fictitious facts" in the form of descriptions, essays, photos, videos, etc. containing nicely described, presented never and nowhere seemingly facts is to keep Big Data systems updated, update data sets and information, based on which they create answers to questions, create descriptions, photos, companies and so on. This is important because current online tools like ChatGPT often create "nicely described fictitious facts," which is used to generate fake news and misinformation in online social media. When all that I have written above would be corrected and the use completed, and not only in some parts of the world but on a global scale, then the creation of a new generation of something similar to ChatGPT, which will use databases built solely on the basis of online scientific knowledge bases, online scientific portals and online indexing databases of scientific publications would make sense and could prove helpful to people, including researchers and scientists. Besides, the current online ChatGPT-type tools are not perfect, as they draw data not directly in real-time online from specific databases and knowledge contained in selected websites and portals, but draw information, knowledge, data from an offline database created some time ago. For example, currently the most popular ChatGPT still relies on a database of data, information, etc. contained in many publication texts downloaded from selected websites and web portals but not today or yesterday downloaded only in 2021! So these are data and information already outdated on many issues. Hence the absurdities, inconsistencies with the facts, creation of "fictitious facts" by ChatGPT in a significant part of the answers generated by this system to questions asked by Internet users. In view of the above, in a number of issues, both technological, organizational, formal, normative, etc., such intelligent systems should be improved so that they can be used in open access in the applications I wrote about above.
In view of the above, I address the following question to the esteemed community of scientists and researchers:
In your opinion, does it make sense to create a new generation of something similar to ChatGPT, which will use databases built solely on the basis of continuously updated data, information, objectively verified knowledge resources taken from online scientific knowledge bases, online scientific portals and online indexing databases of scientific publications?
What do you think about creating a new generation of something similar to ChatGPT, which will use exclusively online scientific knowledge resources?
And what is your opinion about it?
What is your opinion on this topic?
Please answer,
I invite everyone to join the discussion,
Thank you very much,
Warm regards,
Dariusz Prokopowicz
Counting on your opinions, on getting to know your personal opinion, on a fair approach to the discussion of scientific issues, I deliberately used the phrase "in your opinion" in the question.
The above text is entirely my own work written by me on the basis of my research.
Copyright by Dariusz Prokopowicz